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^ (54) Title: PROCESS FOR EXPRESSION AND SECRETION OF PROTEINS BY THE NON -CONVENTIONAL YEAST ZY- 
O GOSACCHAROMYCES BAILII 

(57) A bstract^ Herein is disclosed a method for the production of proteins. The protein is expressed by a yeast belonging to the 
^5 ^p^c{6^ygosar.cha romyces bailii. The yeast secretes the protein produced into the culture medium from where it is isolated, thereby 
simplifying the isolation process. Preferably the yeast is cultivated in chemically defined medium, thereby further simplifying the 
isolation process significantly. 
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5 High level production of proteins from engineered organisms (recombinant, 
mutagenised, ...) provides an alternative to the extraction of the proteins from 
natural sources. Natural sources of proteins are often limited, and furthermore the 
concentration of the desired product is generally low so extraction is regularly 
very cost-intensive or even impossible. Besides, extraction might bear the danger 
10 of toxic or infectious contamination depending on the natural origin of the protein. 

With the advent of molecular cloning in the mid- 70s, it became possible to 
produce foreign proteins in new hosts. Recombinant DNA (rDNA) technologies 
(genetic, protein and metabolic engineering) allow the production of a wide range 
of peptides and proteins from naturally-non producing cells. In fact the first 

15 biotech-products on the world market made by means of rDNA were 
pharmaceutical products (for example insulin, interferons, erythropoietin, vaccine 
against hepatitis B) and industrial enzymes (for example used for the treatments of 
food, feed, detergents, paper-pulp and health care). World-sales of the top-20 
recombinant pharmaceutical products in 2000 was about 13 billions Euro, while 

20 the world-wide market for the industrial enzymes was about 2.0 and it is projected 
to reach about 8 billions Euro in 2008. 

Microorganisms as well as cultured cells from higher organisms (such as 
mammalians, insects or plants) represent the mainly conceivable hosts for the 
production of heterologous as well as homologous proteins. 

25 Several processes using mammalian cell culture for the production of proteins 
have been developed and many in such a manner produced proteins are on the 
market. Among them, several vaccines, monoclonal antibodies, interferon, blood 
factors, urokinase and tPA, hormones and growth factors. 

The main advantage of a mammalian cell based expression system is the ability of 
30 mammalian cells to process the proteins in a proper way (correct folding, 
appropriate post-translational modification, correct glycosylation, specific 
proteolytic activities, etc.). A cloned protein expressed from recombinant DNA of 
mammalian origin (human) is usually correctly processed and folded and 
commonly secreted into the medium, allowing a fast recovery and purification. 
35 On the other hand the costs of production are generally quite high due to a usually 
low level of expression, costs of the mammalian medium components, very slow 
growth rates and demanding culture conditions. Furthermore, production in 
mammalian cells bears the danger of toxic or infectious contamination of the 
product. 
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Microorganisms (prokaryotic as well as eukaiyotic) are advantageous hosts for the 
production of proteins because of high growth rates and commonly ease of genetic 
manipulation. But, in particular, bacterial hosts lack the ability of a correct protein 
processing and in a lot of cases heterologously produced proteins build up 
inclusion bodies inside of the bacterial cells, whereupon the proteins are lost, 
because their enzymatic activity can in most instances not be reconstituted. Due to 
their incorrect structure any use of such proteins for the treatment of humans is 
also excluded. 

Yeast hosts can combine the advantages of unicellular organisms (i.e., ease of 
genetic manipulation and growth) with the capability of a protein processing 
typical for eukaiyotic organisms (i.e. protein folding, assembly and post- 
translational modifications), together with the absence of endotoxins as well as 
oncogenic or viral DNA. Starting from the early 80s, the majority of recombinant 
proteins produced in yeast have been expressed using Saccharomyces cerevisiae 
(Hitzeman, R. A. et ah, 1981, Nature 293, 717-22). The choice was determined by 
the familiarity of molecular biologists to this yeast together with the accumulated 
knowledge about its genetics and physiology. Furthermore, S. cerevisiae is an 
organism generally regarded as safe (GRAS). However, this yeast is not an 
optimal host for the large-scale production of foreign proteins, especially due to its 
characteristics regarding fermentation needs. In particular, growth of S. cerevisiae 
shows a very pronounced Crabtree effect, therefore fed-batch fermentation is 
required to attain high-cell densities (see for example Porro, D., et al., 1991, Res. 
Microbiol. 142, 535-9). Furthermore, this yeast is comparatively sensitive 
regarding the culture conditions, for example regarding the pH value and the 
temperature. Therefore, its cultivation is complicated and requires a highly 
sophisticated equipment. In addition, the proteins produced by S. cerevisiae are 
often hyper-glycosylated and retention of the products within the periplasmic 
space is frequently observed (Reiser, J. et al., 1990, Adv. Biochem. Eng./Biophys. 
43, 75-102 and Romanos, M. A. et al., 1992, Yeast 8, 423-88). Furthermore, due 
to the partial retention of the protein in £ cerevisiae, a fraction of the protein is 
commonly degraded. These respective degradation products are generally very 
difficult to remove from the desired product. Disadvantages such as these have 
promoted a search for alternative hosts, trying to exploit the great biodiversity 
existing among the yeasts, and starting the development of expression systems in 
the so-called "non conventional" yeasts. Prominent examples are Hansenula 
polymorpha (Buckholz, R. G. et al., 1991, Bio/Technology 9, 1067-72); Pichia 
pastoris (Fleer, R., 1992, Curr. Opin. Biotechnol. 3, 486-96); Kluyveromyces 
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lactis (Gellissen, G. et al., 1997, Gene 190, 87-97); Yarrowia lipolytica (Muller, 
S. et al., 1998, Yeast 14, 1267-83) among others. Another yeast genus under 
investigation is the genus Zygosaccharomyces. Eleven species, which appear to be 
evolutionary quite close to S. cerevisiae and not so far from K. lactis have been 
5 classified so far (James, S. A. et al., 1994, Yeast 10, 871-81, Steels, H., et al., 
1999, Int J. Syst. Bacterid. 49, 319-27 and Kurtzman, C. P., et al., 2001, FEMS 
Yeast research 1, 133-8). An exceptional resistance to several stresses renders 
some of the Zygosaccharomyces species potentially interesting for industrial 
purposes. For example Z rouxii is known to be salt tolerant (osmophilic) and Z. 

10 bailii is known to tolerate high sugar concentrations and acidic environments as 
well as relatively high temperatures of growth (Makdesi, A. K. et al. 1996, Int. J. 
Food Microbiol. 33, 169-81 and Sousa, M. J. et al., 1996, Appl. Environm. 
Microbiol. 62, 3152-7). However, the data available related to the molecular 
biology of these yeasts are very poor. While expression and secretion of a 

15 heterologous protein could be achieved in Z. rouxii (Ogawa, Y. et al. 1990, Agric. 
Biol. Chem. 54, 2521-9), for Z. bailii just the first molecular tools to successfully 
transform this yeast and to express heterologous proteins intracellulary have been 
developed (WO 00/41477). Since purification of intracellular proteins is very 
elaborate, the use of this host for industrial production processes remains limited. 

20 Furthermore, while a lot of such non-conventional yeasts show specific 
advantages regarding their fcultivation requirements, a lot of times these 
advantages are foiled by unexpected negative characteristics or unsolvable 
problems in their handling. In a lot of instances the tools for transformation of the 
organisms or expression of heterologous genes are not developed or the 

25 development fails due to unfavourable natural properties of the organism in 
question. The secretory capabilities often impose further problems for the 
production of proteins in industrial scale. If the organism does not allow the 
efficient secretion of the desired protein, the isolation of the product is 
significantly complicated. In addition, some very interesting products, such as 

30 Interleukin 1-(J, turned out to be toxic for the cells as long as they are 
intracellulary located (Fleer, R. et aL, 1991, Gene 107, 285-95). Production of 
such proteins is therefore only possible if the host comprises a highly potent 
secretory system that can be exploited. Another problem come from a potentially 
different codon usage or codon frequency that can hamper the expression of 

35 heterologous genes in such organisms decisively. 

In consideration of the state of the art, the problem to be solved by the present 
invention was to provide a new, easy and economical method for the production 
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of proteins. Apart of being cost effective that method should be easy to perform 
and allow the production of highly pure proteins in a high yield. 

This problem as well as all further not explicitly mentioned problems, that are 
easily deduced from the introductory explicated contents, are solved by the objects 
5 outlined in the claims of the instant invention. 

An advantageous process for the production of a protein is provided by a method 
as outlined in claim one. This method comprises culturing a Zygosaccharomyces 
bailii strain expressing and secreting the protein and isolating the protein. This 
process is particularly advantageous in that Z. bailii can be cultured yieldingly in a 

10 chemically defined medium without the addition of complex ingredients that have 
to be separated tediously from the protein produced. Surprisingly, the secretory 
capacity of this yeast in chemically defined medium is significantly superior to the 
secretory capacity of S. cerevisiae under identical conditions. A further important 
advantage is the surprising fact that the protein produced by Z. bailii is not only 

15 readily secreted but also near to completion, what is not the case for S. cerevisiae 
under identical conditions. Through efficient secretion of the desired protein by Z 
bailii also no degradation of the protein takes place. Subsequently, the purification 
of the product is significantly simplified. 

Further major advantages of Z bailii as host organism for protein production, and 
20 in particular for production of heterologous proteins are a naturally favourable 
codon usage as deduced from the examples presented herein and the 
comparatively low demands on the culture conditions. This is in particular due to 
a high acid and temperature tolerance as well as a weak Crabtree effect allowing 
the cultivation with a high sugar concentration from the beginning (i.e. batch 
25 instead of fed-batch cultivation) and the omission of extremely sophisticated 
regulations of the culture conditions such as temperature or pH. Accordingly, this 
method allows a cost effective production of proteins in an easy way even in 
industrial scale yielding proteins of high purity. 

The term "expression" of a protein by a host cell is well known to the skilled 
30 artisan. Usually expression of a protein comprises transcription of a DNA 
sequence into a mRNA sequence followed by translation of the mRNA sequence 
into the protein. A more detailed description of the process can be found for 
example in Knippers, R. et al, 1990, Molekulare Genetik, Chapter 3, Georg 
Thieme Verlag, Stuttgart. 

35 The term "secretion" of a protein as known in the art means translocation of the 
protein produced, from inside of the cell to outside of the cell, thereby 
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accumulating the protein in the culture medium. A more detailed description of 
the process can be found for example in Stryer, L., 1991, Biochemie, Chapter 31, 
Spektrum Akad. Verlag, Heidelberg, Berlin, New York. 

The protein produced might be any protein known in the art for which an 
5 industrial production is desirable. For example the protein might be useful in the 
pharmaceutical field, such as medication or vaccine or in pre-clinical or clinical 
trials among others (examples are growth hormones, tissue plasminogen activator, 
hepatitis B vaccine, interferones, erythropoietin). The protein produced might also 
be useful in industry for example in the area of food production (e.g. 
( 10 P-galactosidase, chymosin, amylases, glucoamylase, amylo-glucosidase, invertase) 
or textile and paper production (proteases, amylases, cellulases, lipases, catalases, 
etc.). Enzymes are useful among others as detergents (proteases, lipases and 
surfactants) and their characteristics of stereo-specificity are furthermore 
exploitable in a wide number of byconversions, yielding a desired chiral 
15 compound. Another promising application of recombinant enzymes that can be 
produced by the method of the instant invention is the development of biosensors. 

The proteins secreted can vary greatly in size (molecular weight). The herein 
described method functions well for very small proteins (e. g. IL-ip, 17 kDa, see 
Fig. 5), but also for quite large proteins (e.g. GAA, 67.5 kDa, see Fig. 8a). The 

20 secreted proteins may or may not comprise consensus sites for glycosylation. Such 
consensus sites might occur naturally or might be introduced by genetic 
engineering. Depending on the intended use of the protein produced it might also 
be advantageous to remove naturally occurring consensus sites for glycosylation 
by genetic engineering, thereby preventing for example hyper-glycosylation of the 

25 protein. Remarkably, the herein described method leads to proteins that conserve 
their desired catalytic characteristics after the secretion (e.g. GAA, see Fig. 8a). 

In one embodiment of the present invention the Z. bailii strain is transformed with 
a vector comprising a DNA sequence coding for the protein, functionally linked to 
a signal sequence leading to the secretion of the protein and further functionally 
30 linked to a promoter leading to the expression of the protein. 

The term "vector** refers to any agent as such a plasmid, cosmid, virus, phage, or 
linear or circular single-stranded or double-stranded DNA or RNA molecule, 
derived from any source that carries nucleic acid sequences into a host cell. 
Preferably a vector is capable of genomic integration or autonomous replication. 
35 Such a vector is capable of introducing a 5' regulatory sequence or promoter 
region and a DNA sequence for a selected gene product into a cell in such a 
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manner that the DNA sequence is transcribed into a functional mRNA, which may 
or may not be translated and therefore expressed. Preferably the vector is an extra- 
chromosomal plasmid. Such a plasmid comprises preferably an autonomously 
replicating sequence (ARS) and advantageously a centromeric sequence (CEN) in 
5 addition. More preferable the plasmid is a 2ji-like episomal multicopy plasmid. 
Even more preferably the plasmid is derived from an endogenous episomal 
plasmid from a Z bailii strain such as pSB2 (Utatsu, I. et al., 1987, J. Bacteriol. 
169, 5537-45) and more preferably from pZBj or pZBs (see Fig. 9). 

The plasmid pZB 5 was extracted from NCYC 1427 and partially sequenced. 
10 Accordingly, the plasmid comprises preferably at least 35, more preferably at least 
55 and even more preferably at least 75 and even more preferably at least 100 
bases from at least one of the sequences selected from the list of SEQ ID No.: 63, 
SEQ ID No.: 64, SEQ ID No.: 65, SEQ ID No.: 66, SEQ ID No.: 67, SEQ ID No.: 
68, SEQ ID No.: 69, SEQ ID No.: 70 or SEQ ID No.: 71. 

15 Yeast multicopy plasmids (also referred to as 2\x or 2jim-like plasmids) isolated 
from different yeast genus or species usually show a well conserved structural 
homology while having a low sequence homology. Some regulatory elements 
were identified as necessary and sufficient to build a functional multicopy 
plasmid. These are: 

20 the recombinase promoting amplification of these plasmids, encoded by the FLP 
gene. (Blanc H., et al., 1979, Mol. Gen. Genet. 176, 335-42 and Broach J.R. et al., 
1980, Cell 21, 501-8); 

two inverted repeats (IR-sequences); 

a single origin of replication (ARS) at the junction between an internal repeat and a 
25 unique region of the plasmid (Broach J. R. et al., 1980, Cell 21, 501-8; Brewer B. 
J. et al., 1987, Cell 51, 463-71; McNeil J. B., et al., 1980, Curr. Genet. 2, 17-25) 
and 

the regulatory proteins REP1/REP2 (in Z bailii referred to as TFB/TFQ, 
controlling the amplification process, by limiting the recombinase activity in the 
30 cell through-mediated repression of FLP gene expression (Broach J. R. et al., 
1980, Cell 21, 501-8; Jayaram M. et al., 1983, Cell 34, 95-104). 

Within the scope of the instant invention these key elements of the 2\i plasmid are 
preferably derived from Z bailii, even more preferably from Z. bailii NCYC 1427 
or ATCC36947. Particularly preferred these sequences correspond to SEQ ID No.: 
35 71 (IR-ARS), SEQ ID No.: 72 (FLP), SEQ ID No.: 74 (TFB) and SEQ ID No.: 76 
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(JFQ, respectively. The expressed recombinase and the expressed regulatory 
proteins exhibit preferably the amino acid sequence shown in SEQ ID No.: 73 
(FLP\ SEQ ID No.: 75 (TFE) and SEQ ID No.: 77 (JFC) 9 respectively. Preferably 
the plasmid additionally comprises the homologue upstream regions of the FLP 
5 and the TFB/TFC genes, in order to obtain an optimal control of the transcription 
level. 

Generally speaking the plasmid preferably comprises sequences for (autonomous) 
replication, stabilization and/or plasmid copy number control, obtainable from a Z. 
bailii strain. 

10 Preferably the plasmid is pEZi (see Fig. 9c) 

Particularly preferred is the plasmid pEZ 2 (see Fig. 9d). One preferred way to 
construct pEZ 2 is to amplify the IR/ARS region and the TFC/FLP genes including 
their homologous promoters by PCR with the oligos 

5-AGAATC AATCATTTAGTGTGGCAGGAG-3 * (SEQ ID NO.: 90) and 

15 S-TAAAAACTGCCCGCCATATTTCGTC-S 1 (SEQ ID NO.: 91JRAARS), 

5^AGAATGAACTCAGAGTTCTCTCTTG-3' (SEQ ID NO.: 86) and 

S'-CCTATGTCCGAGTTTAGCGAGCTTG-SXSEQ ID NO.: 85, FLP/TFQ 

and to substitute the ARS/CEN cassette from pZ3 with these amplified products. 
Another way is to substitute the 2^i-ori sequence from the plasmid pi 95 with the 
20 aforementioned PCR-products. 

Advantageously, the vector comprises a selectable marker. The term selectable 
marker refers to a nucleic acid sequence whose expression confers a phenotype 
facilitating identification of cells containing the nucleic acid sequence. Selectable 
markers include those which confer resistance to toxic chemicals (= dominant 

25 marker, e.g. G418, hygromycin, formaldehyde, phleomycin or fluoroacetate like 
reviewed in Van den Berg, M. et al, 1997, Yeast 13, 551-9) or complement an 
auxotrophy (^auxotrophic marker, e.g. uracil, histidine, leucine, tryptophane). 
Auxotrophic selection markers can be used for naturally auxotrophic Z bailii 
strains or strains that have been rendered auxotrophic by genetical manipulation, 

30 in particular by (partial) deletion or mutagenisation of an essential gene, e.g. HIS3 
(Branduardi, P., 2002, Yeast 19, 1165-70). As complementing marker sequence 
the homologous gene from Z. bailii or a heterologous gene might be employed. 
Auxotrophic markers are preferred since no component has to be added to the 
medium to keep the selective pressure during the cultivation. 
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The term "promoter" or "promoter region" refers to a DNA sequence, usually 
found upstream (5*) to a coding sequence, that controls expression of the coding 
sequence by controlling production of messenger RNA (mRNA) by providing the 
recognition site for RNA polymerase and/or other factors necessary for start of 
5 transcription at the correct site. The promoter can be derived from any organism. 
Preferably the promoter is derived from a yeast, even more preferably from 
Saccharomyces, Kluyveromyces or Zygosaccharomyces and most preferably from 
Z. rouxii or Z. bailii. The promoter can be constitutive, inducible or repressible. 
Inducible promoters can be induced by the addition to the medium of an 

10 appropriate inducer molecule or by an appropriate change of the chemical or 
physical growth environment (such as the temperature or pH value), which will be 
determined by the identity of the promoter. Repressible promoters can be 
repressed by the addition to the medium of an appropriate repressor molecule or 
by an appropriate change of the chemical or physical growth environment (such as 

1 5 the temperature or pH value), which will be determined by the identity of the 
promoter. Constitutive promoters are preferred, as the use of an appropriate 
repressor or inducer molecule or an appropriate change of the chemical or physical 
growth environment is not required. Preferably the promoter is selected from the 
list of: triose-phosphate isomerase (TPI), glyceraldehyde phosphate 

20 dehydrogenase (GAPDH), alcohol dehydrogenase 1 (ADH1), phosphoglycerate 
kinase (PGK), glyceraldehyde-3-phosphate dehydrogenase (GAP), GAL1, 
GAL 10, acid phosphatase (PHOS), cytochrome C-l (CYC1), copper-binding 
metallothionein (CUP1) or a-factor mating pheromone precursor (Mfal) promoter 
or the hybrid promoters GAL/CYC1, such as GAL1-10/CYC1, GAP/GAL, 

25 PGK/GAL, GAP/ADH2, GAP/PHOS or CYC1/GRE either from S. cerevisiae, Z. 
rouxii or Z. bailii, but preferred from Z bailii. Especially preferred promoters are 
the TPI promoters either from S. cerevisiae corresponding to SEQ ID No.: 78 or Z. 
bailii corresponding to SEQ ID No.: 79, but particularly preferred is the TPI 
promoter from Z. bailii (SEQ ID No.: 79). Further particularly preferred promoters 

30 are the GAPDH promoters from Z. rouxii (SEQ ED No.: 92) or Z bailii. 

Furthermore the vector comprises preferably a transcriptional terminator sequence 
following the coding sequence for the desired protein for efficient mRNA 3 end 
formation. Such a terminator sequence is preferably derived from a yeast, more 
preferably from Saccharomyces or Zygosaccharomyces, even more preferably 
35 from S. cerevisiae or Z. bailii and most preferably from Z. bailii. A preferred 
example for a terminator sequence comprises the following tripartite consensus 
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sequence: TAG. . (T-rich). .T A(T)GT\ . ( AT-rich) . .TTT. Another preferred example 
comprises the sequence motif TTTTTATA. 

Further the vector comprises a signalling sequence (=leader sequence; upon 
expression translated into signal peptide or leader peptide). Such sequences lead to 
5 the direction of expressed proteins from the cytosol into the culture medium. In 
other words signal sequences cause the secretion of the proteins and their 
accumulation in the medium. Signal sequences generally code for a continuous 
stretch of amino acids, typically 15 to 60 residues long (up to 150), which 
characteristically include one or more positively charged amino acid(s) followed 

10 by a stretch of about 5 to 10 hydrophobic amino acids, which may or may not be 
interrupted by non-hydrophobic residues. Preferably the signal peptide comprises 
15-45 amino acids, even more preferably 15 to 30 amino acids. Even though their 
amino acid sequences can vary greatly, the signal peptides of all proteins having 
the same destination in one organism are functionally interchangeable: physical 

15 properties, such as hydrophobicity or the pattern of charged amino acids, often 
appear to be more important in the signal-recognition process than the exact 
amino acid sequence. 

Preferably the DNA sequence coding for the signal peptide is selected from the 
list of: SEQ ID NO.: 1, SEQ ID NO.: 3, SEQ ID NO.: 5, SEQ ID NO.: 7, SEQ ID 

20 NO.: 9, SEQ ID NO.: 11, SEQ ID NO.: 13, SEQ ID NO.: 15, SEQ ID NO.: 17, 
SEQ ID NO.: 19, SEQ ID NO.: 21, SEQ ID NO.: 23, SEQ ID NO.: 25, SEQ ID 
NO.: 27, SEQ ID NO.: 29, SEQ ID NO.: 31, SEQ ID NO.: 33, SEQ ID NO.: 35, 
SEQ ID NO.: 37, SEQ ID NO.: 39, SEQ ID NO.: 41, SEQ ID NO.: 43, SEQ ID 
NO.: 45, SEQ ID NO.: 47, SEQ ID NO.: 49, SEQ ID NO.: 51, SEQ ID NO.: 53, 

25 SEQ ID NO.: 55, SEQ ID NO.: 57, SEQ ID NO.: 59, SEQ ID NO.: 61. Even more 
preferably the amino acid sequence of the signal peptide is selected from the list 
of: SEQ ID NO.: 2, SEQ ID NO.: 4, SEQ ID NO.: 6, SEQ ID NO.: 8, SEQ ID 
NO.: 10, SEQ ID NO.: 12, SEQ ID NO.: 14, SEQ ID NO.: 16, SEQ ID NO.: 18, 
SEQ ID NO.: 20, SEQ ID NO.: 22, SEQ ID NO.: 24, SEQ ID NO.: 26, SEQ ID 

30 NO.: 28, SEQ ID NO.: 30, SEQ ID NO.: 32, SEQ ID NO.: 34, SEQ ID NO.: 36, 
SEQ ID NO.: 38, SEQ ED NO.: 40, SEQ ID NO.: 42, SEQ ID NO.: 44, SEQ ID 
NO.: 46, SEQ ID NO.: 48, SEQ ID NO.: 50, SEQ ID NO.: 52, SEQ ID NO.: 54, 
SEQ ID NO.: 56, SEQ ID NO.: 58, SEQ ID NO.: 60, SEQ ID NO.: 62. 
Particularly preferred the DNA sequence coding for the signal peptide is selected 

35 from the list of SEQ ID NO.: 1, SEQ ID NO.: 3, SEQ ID NO.: 21 or SEQ ID NO.: 
35 correspondingly the amino acid sequence of the signal peptide is preferably 



WO 2004/042036 



PCT/EP2003/012377 



10 



selected from the list of SEQ ID NO.: 2, SEQ ID NO.: 4, SEQ ID NO.: 22 or SEQ 
ED NO.: 36. 

The signal peptide is preferably removed from the finished protein. This can occur 
through activity of a specialised signal peptidase. The signal peptidase can be of 
5 homologous or heterologous origin. Therefore, the signal peptide comprises 
preferably a processing site or a cleavage site that allows for recognition by a 
specific endopeptidase. 

In a preferred embodiment of the present invention the Z. bailii strain is 
transformed with a vector comprising the DNA sequence coding for the protein, 

10 functionally linked to the signalling pre-sequence (16 aa) of the alpha-subunit of 
the Kl killer toxin of K. lactis (Stark M.J. et al., 1986, EMBO J. 5,1995-2002, 
SEQ ID NO.: 35 (DNA) and SEQ ED NO.: 36 (peptide)) and further functionally 
linked to the TPI promoter from S. cerevisiae. More preferably the vector is pZakl 
(Figure lb). Even more preferably the Z. bailii strain is transformed with a vector 

15 comprising the DNA sequence coding for the protein, functionally linked to the 
signal sequence of the Kl killer toxin of K. lactis and further functionally linked 
to the GAPDH promoter from Z. rouxii. Even more preferably the Z. bailii strain 
is transformed with a vector comprising the DNA sequence coding for the protein, 
functionally linked to the signal sequence of the Kl killer toxin of K. lactis and 

20 further functionally linked to the TPI promoter from Z bailii. Particularly, 
preferred said vector is derived from PZ36T (Figure 4a). 

In another preferred embodiment of the present invention the Z. bailii strain is 
transformed with a vector comprising the DNA sequence coding for the protein, 
functionally linked to the signal sequence of the pre-pro a-factor of S. cerevisiae 

25 and further functionally linked to the TPI promoter from S. cerevisiae. Preferably 
the vector is pZ 3 ppa (Figure lc). Even more preferably the Z. bailii strain is 
transformed with a vector comprising the DNA sequence coding for the protein, 
functionally linked to the signal sequence of the pre-pro a-factor of S. cerevisiae 
and further functionally linked to the GAPDH promoter from Z. rouxii. Even more 

30 preferably the Z. bailii strain is transformed with a vector comprising the DNA 
sequence coding for the protein, functionally linked to the signal sequence of the 
pre-pro a-factor of S. cerevisiae and further functionally linked to the TPI 
promoter from Z. bailii. Particularly preferred said vector is derived from pZ 3 &T 
(Figure 4a). 

35 In yet another preferred embodiment of the present invention the Z. bailii strain is 
transformed with a vector comprising the DNA sequence coding for the protein, 
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functionally linked to the zygocin killer toxin pre-sequence of Z. bailii (SEQ ID 
No.: 59) and further functionally linked to a promoter functional in Z. bailii. 
Preferably said promoter is the TPI promoter from S. cerevisiae. Even more 
preferably said promoter ist the TPI promoter from Z. bailii. Most preferred is the 
5 GAPDH promoter from Z rouxii. 

The DNA sequence coding for the protein can be derived from animal, bacterial, 
fungal, plant or viral sources, more preferably from metazoan, mammalian or 
fungal sources. The expressed protein might therefore be homologous or 
heterologous to Z. bailii. 

10 Any yeast belonging to the species Z. bailii can be used for the production of 
proteins in the scope of the present invention. In a preferred embodiment of the 
invention the Z. bailii strain is transformed. 'Transformation" refers to a process 
of introducing an exogenous nucleic acid sequence (of homologous and/or 
heterologous origin, recombinant or not) into a cell in which that exogenous 

15 nucleic acid is incorporated into a chromosome or is capable of autonomous 
replication. A cell that has undergone transformation, or a descendant of such a 
cell, is "transformed" or "recombinant". If the exogenous nucleic acid comprises a 
coding region encoding a protein and the protein is produced in the transformed 
yeast such a transformed yeast is functionally transformed. Preferred methods to 

20 transform Z. bailii are electroporation, as described in [WO 00/41477], o* the 
chemical LiAc/PEG/ssDNA method as described by Agatep, R. et al., 1998, 
Technical Tips Online (http://tto.trends.com). 

Preferably the Z. bailii strain that is being transformed is selected from the list of: 
ATCC 36947, ATCC 60483, ATCC 8766, FRR 1292, ISA 1307, NCYC 128, 
25 NCYC 563, NCYC 1416, NCYC 1427, NCYC 1766, NRRL Y-2227, NRRL Y- 
2228, NRRL Y-7239, NRRL Y-7254, NRRL Y-7255, NRRL Y-7256, NRRL Y- 
7257, NRRL Y-7258, NRRL Y-7259, NRRL Y-7260, NRRL Y-7261, NRRL Y- 
27164; particularly preferred are ATCC 36947, ATCC 60483, ATCC 8766 and 
NCYC 1427. 

30 (ATCC: American Type Culture Collection, Manassas VA, USA; FRR: FRR 
Culture Collection, North Ryde NSW, Australia; ISA: Culture Collection of the 
Institute Superior de Agronomia, Lisbon; NCYC: National Collection of Yeast 
Cultures, Norwich, UK; NRRL: Agricultural Research Service Culture Collection, 
Peoria IL, USA). 

35 Within the scope of the present invention the Z bailii strain can be subjected to a 
selection process for improved secretion. Screening for and isolation of such a 
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"super-secreting" phenotype can occur before or after transformation of the 
respective Z bailii strain. 

In a preferred embodiment of the present invention the Z bailii gene/s 
homologous to GAS1 from S. cerevisiae are identified and disrupted. GAS J is one 
5 example for the few cases wherein the key molecules involved in the intriguingly 
complex secretory pathway have been identified. It was possible to influence the 
whole secretory mechanism modifying the Gasl expression level in 5. cerevisiae 
(Vai, M., et al., 2000, AppL Environ. Microbiol. 66, 5477-9) due to a resultant 
modification of the organisation of the cell wall structure, namely it was 
1 0 demonstrated that gasl mutants show a "super-secreting" phenotype (Popolo L., et 
al., 1997, J. BacterioL 180, 163-6; Ram A. F. J., et al., 1998, J. BacterioL 180, 
1418-24). 

In another preferred embodiment of the present invention the Z. bailii strain has 
undergone one or more mutagenisation/selection cycle(s) to obtain super secreting 

15 mutants, comprising chemical or physical mutagenesis. Preferably the muta- 
genisation is caused by orthovanadate. Orthovanadate is a molecule known to 
affect the glycosylation process and the cell wall construction in S. cerevisiae 
(Kanik-Ennulat, C et al., 1990, Mol. Cell. Biol. 10, 898-909). Methods involving 
orthovanadate mutagenisation to obtain cells with changed cell wall 

20 construction/secretory properties, that are useful in the scope of the present 
invention are disclosed in more detail for example for S. cerevisiae (Willsky. 
G.R., et al, 1985, J. BacterioL 164, 611-7) and K lactis (Uccelletti, D., et al., 
1999, Res. Microbiol. 150, 5-12; Uccelletti. D., et al., 2000, Yeast 16, 1161-71). 

Culturing techniques and media suitable for yeast are well known in the art. 
25 Typically, but it is not limited to, culturing is performed by aqueous fermentation 
in an appropriate vessel. Examples for a typical vessel for yeast fermentation 
comprise a shake flask or a bioreactor. 

The culture is typically performed at a temperature between 20°C and 40°C, 
preferably between 25°C and 35°C and even more preferred between 28°C and 
30 32°C. 

The medium in which the Z. bailii strain is cultured can be any medium known in 
the art to be suitable for this purpose. The medium might contain complex 
ingredients or might be chemically defined. Chemically defined media are 
preferred. The medium comprises any component required for the growth of the 
35 yeast. In particular the medium comprises a carbon source, such as fructose, 
glucose or other carbohydrates (such as sucrose, lactose, D-galactose, or 
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hydrolysates of vegetable matter, among others). Typically, the medium also 
comprises further a nitrogen source, either organic or inorganic, and optionally the 
medium may also comprise macro nutrients and/or micro nutrients such as amino 
acids; purines; pyrimidines; com steep liquor; yeast extract; protein hydrolysates, 
5 such as peptone; vitamins (water-soluble and/or water-insoluble), such as B 
complex vitamins; or inorganic salts such as chlorides, hydrochlorides, 
phosphates, or sulphates of Ca, Mg, Na, K, Fe, Ni, Co, Cu, Mn, Mo, or Zn, among 
others. Antifoam might be added, if necessary. Further components known to one 
of ordinary skill in the art to be useful in yeast culturing or fermentation can also 

10 be included. The medium may or may be not buffered. A preferred medium 
comprises yeast extract, peptone and glucose (=YPD). A more preferred medium 
comprises yeast extract, peptone and fructose (=YPF). An even more preferred 
medium comprises glucose and Yeast Nitrogen Base (YNB, Difco Laboratories, 
Detroit, MI #919-15). Another even more preferred medium comprises fructose 

15 and YNB. 

Particularly preferred is a medium comprising high fructose com syrup as carbon 
source (for example Isosweet® 100 42% High Fructose (80% solids) or 
Isosweet® 5500 55% Fructose from Tate & Lyle PLC or IsoClear® 42% High 
Fructose Corn Syrup or IsoClear® 55% High Fructose Corn Syrup from Cargill, 
20 Inc.). 

The compositions of preferred media for batch/fed batch cultivation of Z. bailii 
according to the instant invention are as follows: the batch phase medium 
comprises 4% w/V Glucose, 0.5% w/V (NH^SC^, 0.05% w/V MgS0 4 , 0.3% 
w/V KH 2 P0 4 , vitamins according to Verduyn, C, et al., 1992, Yeast 8, 501-17, 

25 wherein the final concentration of vitamins will be 3 times in respect to the 
indicated concentrations and trace elements according to Verduyn, C, et al., 1992, 
Yeast 8, 501-17, wherein the final concentration will also be 3 times in respect to 
the indicated concentrations. The pH control (value: pH 5) is performed by the 
addition of 2M KOH. The fed-batch medium comprises 50% w/V Glucose, 

30 15.708 g/1 KH2PO4, 5 g/1 KC1, 5.831 g/1 MgS0 4 , 1,2 g/1 CaCl 2 , 1 g/1 Yeast 
Extract, 0.4447 g/1 NaCl, 1 g/1 Glutamate, 0,05 g/1 ZnS0 4 , 0,04 g/1 CuS0 4 , 0,05 
g/1 MnCl 2 , 0,001 g/1 C0CI2, 0.5 g/1 myo-inositol, 0.1 g/1 thiamine hydrochloride, 
0.02 g/1 pyridoxol hydrochloride, 0.04 g/1 Ca-D(+)panthotenate, 0.004 g/1 d-biotin, 
0.09 g/1 nicotinic acid. The pH control (value: pH 5) is performed by the addition 

35 of 2MNH4OH. 
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In case of selection for the dominant G418 marker 200mg/l G418 is added to the 
respective medium. 

The use of a defined medium, of which the components are adjusted to the needs 
of the organism is preferred. The purification of the protein is thereby significantly 
5 simplified. 

Preferably, the pH of the culture medium ranges between 2 and 9, more preferably 
between 3 and 8 and even more preferably between 4 and 7. The pH can be 
regulated or partially regulated or not be regulated during the course of 
fermentation; accordingly the pH can be kept constant at a preferred value or can 
10 change during fermentation. A significant advantage of Z bailii is its surprising 
capacity to grow as well as express and secrete proteins at low pH. Therefore, the 
demand of this organisms for a strictly controlled pH is not very pronounced. 

The cultivation can take place in batch, fed-batch or continuous mode as is known 
to the ordinary skilled artisan. 

1 5 During the course of the fermentation, the desired protein is expressed, properly 
processed (i.e. folded, modified, cut, etc.) and secreted ^accumulated in the 
medium). While the protein produced may be partially retained within the yeast 
cells it is preferred that a substantial amount of the protein is secreted. Even more 
preferred is that the protein is entirely secreted. 

r 

20 After culturing has progressed for a sufficient length of time to produce a desired 
concentration of the protein in the yeast and/or the culture medium, the protein is 
isolated. 'Isolated," as used herein to refer to the protein, means being brought to 
a state of greater purity by separation of the protein from at least one other 
component of the yeast or the medium. Preferably, the isolated protein is at least 

25 about 80% pure as based on the weight, more preferably at least about 90% pure 
as based on the weight and even more preferably at least about 95% pure as based 
on the weight. Evidence of purity can be obtained by SDS-PAGE, 2D 
electrophoresis, IF, HPLC, mass spectrometry, capillary electrophoresis or other 
methods well known in the art. 

30 "Purity" refers to the absence of contaminants in the final purified protein. Typical 
contaminants to be separated from the desired product are proteins, pyrogens, 
nucleic acids and more. 

The protein is isolated from the culture medium, preferably without lysing of the 
cells. Such an isolation comprises purifying the protein from the medium. 
35 Purification can be achieved by techniques well-known in the art, such as filtration 
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(e.g. microfiltration, ultrafiltration, nanofiltration), crystallisation or precipitation, 
centrifiigation, extraction, chromatography (e.g. ion exchange, affinity, 
hydrophobic exchange), among others. 

Upon removal of the cells, the culture broth might also directly serve as the 
5 product (e.g. enzyme solution), without further purification. The medium 
components can be adjusted appropriately prior to the cultivation. 

If the protein is not completely secreted, the protein can also be isolated from both 
the yeast cells and the medium. Methods for lysing of the yeast cells are known in 
the art and comprise chemical or enzymatic treatment, treatment with glass beads, 
10 sonication, freeze/thaw cycling, or other known techniques. The protein can be 
purified from the various fractions of the yeast lysate by appropriate techniques, 
such as filtration (e.g. microfiltration, ultrafiltration, nanofiltration), crystallisation 
or precipitation, centrifugation, extraction, chromatography (e.g. ion exchange, 
affinity, hydrophobic exchange), among others. 

15 Another embodiment of the present invention relates to a Z. bailii strain, 
expressing and secreting a heterologous protein. 

The Z. bailii strain might be transformed with a vector comprising a DNA 
sequence coding for the heterologous protein, functionally linked to a signal 
sequence leading to the secretion of the protein and further functionally linked to a 
20 promoter. 

Description of the Figures: 

Figure l:Expression and Secretion Vectors 

Schematic maps of the plasmids constructed for expression of proteins in Z. bailii: 
25 a : pZ 3 , (intracellular expression), b : pZ 3 kl (expression and secretion) and c : 
pZ 3 ppa (expression and secretion). 

a) pZ 3 : the backbone of the plasmid is the pYX022 S. cerevisiae expression 
plasmid (R&D Systems, Inc., Wiesbaden, D; the expression cassette is based on 
the constitutive 5. cerevisiae TPI promoter and the corresponding polyA signal, as 
30 indicated in the Figure). The ARS/CEN fragment, from Ycplac33 (Gietz, R. D., et 
al., 1988, Gene 74, 527-34) ensures replication and stability of the plasmid, while 
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the Kan R cassette, derived from pFA6-KanMX4 (Wach, et al., 1994, Yeast 10, 
1793-808) allows a G418-based selection of the transformants. 

b) pZ 3 kl: a pZ 3 expression vector comprising the signal sequence of the K. lactis 

s 

Kl killer toxin (kl) for leading the secretion of the protein of interest. 

5 c) pZ 3 ppct: a pZ 3 expression vector comprising the pre-pro leader sequence of the 
S. cerevisiae pheromone a-factor (pre-pro-aF) for leading the secretion of the 
protein of interest. 

(Amp= ampicillin resistance cassette; MCS^ multi cloning site; colEl ori; E. coli 
replication origin) 

10 

Figure 2: Expression and Secretion Vectors 

Schematic maps of the plasmids constructed for expression and secretion of the 
human IL-ip (Auron, E., et al., 1984, PNAS 81, 7907-11) and the GFP (Heim, R. 
et al., 1996, Curr. Biol. 6, 178-82) in Z bailii. 

15 a) pZ 3 klIL-lp: a pZ 3 kl vector where the sequence encoding for the human EL-lp 
was sub-cloned into the MCS. 

b) pZ 3 ppaIL-lp: a pZ 3 ppa vector where the sequence encoding for the human IL- 
1 P was sub-cloned into the MCS. 

c) pZ 3 ppocGFP: a pZ 3 ppa vector where the sequence encoding for the GFP was 
20 sub-cloned into the MCS. 



Figure 3: Expression and Secretion Vectors 

Schematic maps of the plasmids constructed for the expression of the Arxula 
adeninivorans glucoamylase (GAA, Genebank accession no: Z46901, Bui Minh, 
25 D., et al., 1996, Appl. Microbiol. Biotechnol. 44, 610-9) and of the bacterial P- 
galactosidase (from the plasmid pSV-P-galactosidase of Promega, Inc.; Genebank 
accession no.: X65335) in Z. bailii. 
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a) PZ3GAA: a pZ 3 vector where the sequence encoding for the glucoamylase 
(GAA) was sub-cloned into the MCS. 

b) pZaLacZ: a pZ 3 vector where the sequence encoding for the p-galactosidase 
was sub-cloned into the MCS. 

5 

Figure 4: Expression Vectors 

Schematic maps of the plasmids constructed for the expression of proteins in Z. 
bailii based on the Z bailii TPI promoter. 

a) PZ36T: a pZ 3 vector where the S. cerevisiae TPI promoter was substituted by 
10 the Z bailii TPI promoter. 

b) pZa&TLacZ: a pZ$bT expression vector where the sequence encoding for the P- 
galactosidase was sub-cloned into the MCS. 

Figure 5: IL-ip secretion 
15 a) Growth kinetics in minimal (YNB) and rich (YPD) medium, with glucose 5% 
(w/V) as a carbon source: the cellular growth was measured as optical density (OD 
660nm, circles) and the residual glucose (g/1, squares) was evaluated. Comparison 
between S. cerevisiae (open symbols) and Z. bailii (full symbols). 

b) Western Blot analyses performed on cellular extracts of £ cerevisiae and Z. 
20 bailii cells transformed with the plasmid pZakUL-ip (expressing IL-lp preceded 

by the leader sequence from the K. lactis killer toxin) and with the corresponding 
empty plasmid (pZ 3 ), as a negative control. In the first lane a positive control (IL- 
lp, human recombinant (E. coli\ Roche cat n° 1 457 756) was loaded. Samples 
were collected at the indicated times and from the indicated media, corresponding 
25 to the kinetics showed in (a). The loaded volumes were rectified for a 
corresponding OD value of 0.08. The blotted membranes were probed with an a- 
EL-ip polyclonal antibody. 

c) as above, were the loaded samples represent the corresponding supernatant. 
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d) as above, were the samples were loaded with an equal volume of medium 
(30^1). 

Figure 6: Leading of the pre-pro-a-factor signal sequence to the secretion of EL- 
5 1 p and of GFP in Z bailii 

a) Western Blot analyses performed on cellular extracts (0 and on supernatants (ii) 
of Z bailii and S. cerevisiae cells transformed with the plasmid pZ 3 ppaIL~l p (and 
with the corresponding empty plasmid pZ 3 ) and growing on YPD medium 
(glucose 2% w/V). Samples were taken at the indicated times. First lane: positive 

10 control (IL-lp, human recombinant (E. coli), Roche cat n° 1 457 756). The blotted 
membranes were probed with an oc-IL-ip polyclonal antibody. 
Western Blot analyses performed on cellular extracts (Hi) and on supernatants (iv) 
of Z. bailii and S. cerevisiae cells transformed with the plasmid pZ3ppaEL-ip (and 
with the corresponding empty plasmid pZ 3 ) and growing on YNB medium 

15 (glucose 5% w/V). Samples were taken at indicated times. First lane: positive 
control (DL-1 p human recombinant (E. coli) Roche cat n° 1 457 756). The blotted 
membranes were probed with an cc-EL-lp polyclonal antibody. 

b) Western Blot analyses performed on cellular extracts (cells) and on 
supernatants (sup) of Z. bailii cells growing on YNB medium (glucose 2% w/V) 

20 transformed with the control plasmid pZ 3 (1 st and 2 nd lanes) and with the plasmid 
pZ 3 ppctGFP (3 rd and 4 th lanes). The blotted membrane was probed with an a-GFP 
polyclonal antibody. An arrow indicates the expected positive signal. 

Figure 7: Batch cultivations of Z bailii cells comprising the pZ 3 klIL-ip 
25 expression plasmid on chemically defined medium in high sugar concentration 

a) Culture OD (full circles), dry weight (open circles), glucose consumption (full 
squares) and ethanol production (open triangle). 

b) Western Blot analyses performed on the growth medium (lane 2 to 5) and on 
the cell extracts (lanes 6 to 9) of Z bailii cells. Samples were collected at the 

30 indicated times of the kinetics, and an equal volume (30jil for the supernatants and 
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15jal for the cell extracts, respectively) was loaded. The blotted membranes were 
probed with an cc-EL-1 J} polyclonal antibody. 

First lane: positive control (IL-10 human recombinant (E. coif) Roche cat n° 1 457 
756). 

5 

Figure 8: Enzymatic activity of heterologous enzymes expressed in Z bailii cells 

a) Determination of the A. adeninivorans glucoamylase activity (mU/OD) present 
in the growth medium (YNB, glucose 2% w/V) of Z bailii cells transformed with 
the plasmid PZ3GAA (and the respective empty plasmid PZ3, as a control). Three 

10 independent clones were analysed (CI. 1, CI. 3 and CI. 5). 

b) Determination of the (3-galactosidase activity (Miller U/OD) in cell extracts of 
Z bailii cells transformed with the plasmid pZ3LacZ (two independent clones) and 
with the plasmid pZ 3 6TLacZ (three independent clones), and the respective empty 
plasmid pZ 3 as a control. Cells were grown in YPD medium (glucose 2% w/V), 

15 and samples were collected at indicated times. 

On the left panel the Z bailii strain ATCC 36947, on the right panel the strain Z 
bailii ATCC 60483 were tested, respectively. 

Figure 9: Construction of a Z bailii multicopy plasmid 
20 Schematic maps of the endogenous plasmids isolated from Z bailii ATCC 36947, 
named pZBi (a) and from Z bailii NCYC 1427, named pZB 5 (b). 

c) : Z bailii multicopy expression vector comprising the genes and the sequences 
necessary and sufficient for a stable and autonomous high copy number 
replication. The expression cassette is based on the Z bailii constitutive TPI 

25 promoter and the polyA, as indicated in the Figure. The marker for selection is the 
Kan R cassette. 

d) Z bailii multicopy expression vector. The expression cassette is based on the Z 
bailii constitutive TPI promoter and the polyA, as indicated in the Figure. 
Furthermore, the vector comprises the IR/ARS region and the TFC/FLP genes 

30 including their homologous promoters as indicated. 
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Figure 10: Influence of the promoter or the plasmid constituents, respectively, on 
P-galactosidase activity. 

Shown is the relative p-galactosidase activity in cell extracts of Z. bailii ATCC 
36947 cells transformed with the indicated plasmids. The P-galactosidase activity 
5 of cells transformed with pZsLacZ was set to 1 and the other activities were 
related to that value. Cells were grown in YPD medium (glucose 2% w/V), and 
samples were collected as the cultures reached an OD 660 value between 1 and 2. 
a) Different promoters in the same plasmid. pZ 3 : ScTPl, pZ 3 6T: Z6TPI, pZ 3 rG: 
ZrGAPDK 

10 b) Different plasmid constituents. pZ 3 : Sc ARS/CEN, pi 95: Sc 2\mi ori sequence, 
pEZ-IA: Zb 2\xm ori sequence (IR-A), pEZ-IAF: Zb 2jim ori sequence (IR-A) + 
FLP, pEZ 2 : Zb 2jun ori sequence (IR-A) + FLP +TFC, pEZ 2 -IB: Zb 2\>m ori 
sequence (IR-A) + FLP + TFC + IR-B. The table indicates the determined plasmid 
stability of the respective constructs. 

15 

Figure 11: Leading of the zygocin pre-sequence to the secretion of H-lp and 
comparison of different leader sequences 

a) Western Blot analyses performed on cellular extracts (z) and on supernatants (if) 
of Z bailii and S. cerevisiae cells transformed with the plasmid pZ 3 kbIL-ip (and 

20 with the corresponding empty plasmid pZ 3 ) and growing on YPD medium 
(glucose 2% w/V). Samples were taken at the indicated times. First lane: positive 
control (IL-lp, human recombinant (E. coli), Roche cat n° 1 457 756). The blotted 
membranes were probed with an a-IL-lp polyclonal antibody. 
Western Blot analyses performed on cellular extracts (in) and on supernatants (zv) 

25 of Z. bailii and S. cerevisiae cells transformed with the plasmid pZ3kbIL-lp (and 
with the corresponding empty plasmid pZ 3 ) and growing on YNB medium 
(glucose 5% w/V). Samples were taken at the indicated times. The blotted 
membranes were probed with an cx-IL-ip polyclonal antibody. 

b) Western Blot analyses performed on supernatants of Z. bailii cells growing on 
30 YNB medium (glucose 2% w/V) transformed with the indicated plasmids. The 

blotted membranes were probed with an cc-EL-ip polyclonal antibody. 
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Figure 12: Glucoamylase Sta2 activity in transformed Z bailii or 5. cerevisiae 
cells, respectively 

Determination of the S. cerevisiae var. diastaticus glucoamylase Sta2 activity 
5 (U/OD) in the growth medium (YNB, fructose 2% w/V) of Z bailii and 5. 
cerevisiae cells transformed with the plasmids PZ3STA2 and pZ3klSTA2 and the 
respective empty plasmid PZ3, as a control (as indicated). In the first plasmid the 
protein is lead to secretion from its own leader sequence, in the second from the 
AT. lactis killer toxin pre-leader sequence. Measurements were repeated more times 
10 and on independent clones, and variation levels are indicated with error bars. 



Examples: 

The following examples are included to demonstrate preferred embodiments of 
15 the invention. It should be appreciated by those of skill in the art that the 
techniques disclosed in the examples which follow represent techniques 
discovered by the inventors to function well in the practice of the instant 
invention, and thus can be considered to constitute preferred modes for 'its 
practice. However, those of skill in the art should, in light of the present 
20 disclosure, appreciate that many changes can be made in the specific embodiments 
which are disclosed and still obtain a like or similar result without departing from 
the spirit and scope of the invention. 



Example 1: construction of Z. bailii expression plasmids 
25 The Backbone of the new vector pZ 3 (Fig. la) is the basic S. cerevisiae expression 
plasmid YX022 (R&D Systems, Inc., Wiesbaden, D). 

The ARS 1 -CEN4 fragment was taken from Ycplac33 (ATCC 87623, Genbank 
accession no.: X75456 L26352,). It was cutted Clal-blunt/Spel and cloned into 
pYX022 opened DraHI-blunt/Spel (in this way the plasmid lost completely the 
30 HIS gene). 
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The plasmid obtained was opened KpnI-blunt, and here the Kan cassette, derived 
from pFA6-KanMX4 (Wach et al., 1994 Yeast 10, 1793-1808) was inserted. The 
respective fragment was taken out cutting with Sphl/SacI-blunt. This kanMX 
module contains the known kan r open reading-frame of the E. coli transposon 
5 Tn903 fused to transcriptional and translational control sequences of the TEF gene 
of the filamentous fungus Ashbya gossypii (e.g. NRRL Y-1056). The described 
hybrid module permits efficient selection of transformants resistant against 
geneticin (G418). 

The expression cassette based on the constitutive S. cerevisiae TPI promoter and 

10 the respective polyA, interspaced by the multi cloning site (MCS), as indicated in 
the Figure derives from the original pYX022 plasmid (see supplier's information). 
All the other plasmids indicated in the Figures 1 to 4 derive from pZ3. 
For the construction of the plasmid pZ 3 kl (Fig. lb), the signalling pre-sequence 
(16 aa) of the alpha-subunit of the Kl killer toxin of K. lactis (Stark MJ. et al., 

15 1986, EMBO J. 5,1995-2002) was functionally linked to the TPI promoter of the 
pZ 3 plasmid, in order to lead the secretion of the protein of interest. 
For the construction of the plasmid pZ 3 ppoc (Fig. 1 c), the pre-pro-a-factor signal 
sequence was similarly utilised and functionally inserted. The sequence was taken 
from the plasmid pPICZaA (Invitrogen BV, The Netherlands) 

20 For the construction of the plasmid pZ 3 klIL-ip (Fig. 2a), the coding sequence for 
the protein already fused with the killer toxin K, lactis signal sequence was taken 
cutting Xbal/EcoRI-bluntended from the plasmid pCXJ-kanl (Fleer R, et al., 
1991, Gene 107, 285-95) and sub-cloned into the plasmid pZ 3 EcoRI bluntended 
and de-phosphorylated. 

25 For the construction of the plasmid pZ 3 ppaGFP (Fig. 2c), the fragment containing 
the ct-factor pre-pro leader sequence in frame with the GFP coding sequence was 
cutted HindHI bluntended/BamHI from the plasmid pPICAGFPl and sub-cloned 
in the plasmid pZ 3 opened EcoRI bluntended/BamHI and de-phosphorylated. The 
plasmid pPICAGFPl was constructed according to Passolunghi, S., et al. by 

30 introduction of a PCR amplified GFP sequence in frame into the plasmid 
pPICZaA (Invitrogen BV, The Netherlands). The PCR technique is known in the 
art. Exemplary reference is made to Gelfand, D. H., et al., PCR Protocols: A 
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Guide to Methods and Applications, 1990, Academic Press and Dieffenbach, C. 
W. et al., PCR Primer: A Laboratory Manual, Cold Spring Harbor Laboratory 
Press, 1995. 

For the construction of the plasmid pZ 3 ppaIL-l|3 (Fig. 2b), the IL-ip was PCR 
5 amplified from the plasmid pZ3klIL- 1 p . 

The oligos for the amplification are the following: 
Primer: Drdl-IL (SEQ ID NO.: 80) 

5' AAGAGACTCCAACGTCGCGCACCTGTA 3* Tin: 63°C 

Primer: IL C-term (SEQ ID NO.: 81) 
10 5' AGAGGATTAGGAAGACACAAATTGCATGGTGA 3' Tm: 61°C 

The following program was used for the amplification: 



94°C 


5min 




94°C 


45s 


1 


27°C 


45s 


\ 10 cycles 


72°C 


2min 


j 


94°C 


45s 


l 


50°C 


45s 


\ 20 cycles 


72°C 


2min 


j 


72°C 


7min 




4°C 


CO 





In this way a DrdI cutting site for sub-cloning the coding sequence of the IL-ip 
15 protein in frame with the a-factor pre-pro leader sequence was introduced. The 
plasmid pZ3ppocGFP was opened EcoRI bluntended/BaniHI. The PCR fragment 
was cutted DrdI bluntended/BamHI. Combination resulted in the plasmid 
pZ3ppaIL-l(J. 

In the plasmid pZ3kbIL-ip, the coding sequence of the interleukin was 
20 functionally linked to the deduced pre-leader sequence of the Z bailii killer toxin 
zygocin (Genebank accession no.: AF5 15592; Weiler F. et aL 9 2002, Mol 
Microbiol. 46, 1095-105.). Essentially oligonucleotides were synthesized 
corresponding to the deduced pre-leader sequence of the Z. bailii killer toxin 
zygocin (SEQ ID No.: 59) and cloned into the plasmid PZ3. Subsequently, the EL- 
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IP was PCR amplified as explicated before and cloned in-frame to the zygocin 
pre-sequence. 

For the construction of the plasmid pZ 3 GAA (Fig. 3 a), the coding sequence of the 
A. adeninivorans a-glucoamylase was cut BamHI bluntended from the plasmid 
5 pTS32x-GAA (Bui D. M., et a/., 1996, Appl. Microbiol. Biotechnol. 45, 102-6) 
and inserted in the plasmid pZ 3 opened EcoRI bluntended and de-phosphorylated. 
For the construction of the plasmid pZ 3 STA2, the coding sequence of the S. 
cerevisiae var. diastaticus amylase (comprising its own leader sequence) was cut 
Xbal/Asel-blunt from the plasmid pMV35 (Vanoni M. et al % 1989, Biochim 
10 Biophys Acta. 1008, 168-76) and inserted in the plasmid pZ 3 opened EcoRI-blunt. 
For the construction of the plasmid pZ 3 klSTA2, the coding sequence of the same 
amylase but functionally linked to the AT. lactis killer toxin leader sequence was 
cut XhoI/Asel-blunt from the plasmid pMV57 (Venturini M. et aL, 1997, Mol 
Microbiol. 23, 997-1007) and inserted in the plasmid pZ 3 opened EcoRI-blunt. 

15 

For the construction of the plasmid pZ 3 LacZ (Fig. 3b), the coding sequence of the 
bacterial P-galactosidase was cutted HindlU bluntended/BamHI from the plasmid 
pSV-P-galactosidase (Promega, Inc.) and inserted into the plasmid pZ 3 opened 
EcoRI bluntended/BamHI and dephosphorylated. 

20 In the plasmid pZ 3 6T (Fig. 4a), the TPI promoter of S. cerevisiae was substituted 
with the endogenous TPI promoter from Z. bailii. The sequence was PCR 
amplified from the genomic DNA of the Z bailii strain ISA 1307, and the primers 
were designed according to the literature (Merico A., et aL, 2001, Yeast 18, 775- 
80). Extraction of genomic DNA was performed according to the protocol 

25 published by Hoffinan, C. S., et aL y 1987, Gene 57, 267-72). 
The oligos for the amplification are the following: 
TPIprobS (SEQ ID NO.: 82) 

5' ATCGTATTGCTTCCATTCTTCTTTTGTTA 3' Tm: 59.6°C 

TPIprob3 (SEQ ID NO.: 83) 
30 5' TTTGTTATTTGTTATACCGATGTAGTCTC 3' Tm: 59.6°C 
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The following program was used for the amplification: 
94°C 5min 



94°C 45s ] 

57°C 45s ± 25 cycles 

72°C lmin 30s J 



72°C 7min 



4°C 



oo 



The PCR fragment was sub-cloned into the vector pST-Bluel (Novagen, Perfect 
Blunt cloning Kit cat. no. 70191-4), according to the included protocol. 
5 Therefrom, the promoter was cut SnaBI/SacI and sub-cloned into the pZz opened 
Aatn bluntended/SacI (so to remove the S. cerevisiae TPI promoter), obtaining the 
desired plasmid. 

For the construction of the plasmid pZ36TLacZ (Fig. 4b), the coding sequence of 
the bacterial p-galactosidase was cutted HindlH/BamHI bluntended from the 
10 plasmid pSV-P-galactosidase (Promega, Inc.; Genebank accession no.: X65335) 
and inserted into the plasmid pZa&T opened Nhel bluntended and de- 
phosphorylated. 



In the plasmid pZ 3 rG, the TPI promoter of S. cerevisiae was substituted with the 
GAPDH promoter from Z. rouxii. The sequence was PCR amplified from 
15 genomic DNA of the Z rouxii. strain LST11, and the primers were designed 
according to the literature (Ogawa Y. et al 9 1990, Agric Biol Chem. 54, 2521-9). 
Extraction of genomic DNA was performed according to the protocol previously 
mentioned. (Another possible strain is Z. rouxii NRJRL Y-229.) 
The oligos for the amplification are the following: 



20 pZrGAPDHJwd (SEQ ID NO.: 93) 



5' TGCAGAAAGCCCTAAGATGCT 3* 



Tm: 60.3°C 



pZrGAPDH_rev (SEQ ID NO.: 94) 

y XGXCTGXGAXGXACXXXXXAXXXGAXATG y 



Tm: 59.2°C 
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The following program was used for the amplification: 



94°C 


5min 




94°C 


15s 


1 


57°C 


30s 


} 25 cycles 


72°C 


45s 


j 


72°C 


7min 




4°C 


OO 





The obtained PCR fragment (708 bp) was sub-cloned into the vector pST-Bluel 
(Novagen, Perfect Blunt cloning Kit cat. no. 70191-4), according to the included 
5 protocol. Therefrom, the promoter was cut SnaBI/SacI and sub-cloned into the pZi 
opened Aatn bluntended/SacI (so to remove the 5. cerevisiae TPI promoter), 
obtaining the desired plasmid. 

For the construction of the plasmid pZ 3 rGLacZ (Fig. 4b), the coding sequence of 
the bacterial P-galactosidase was cut HindlH/BamHI bluntended from the plasmid 
10 pSV-P-galactosidase (Promega, Inc.; Genebank accession no.: X65335) and 
inserted into the plasmid pZ$rG opened Xhol bluntended and de-phosphorylated. 

DNA manipulation, transformation and cultivation of E. coli (DH5ct), were 
performed following standard protocols (Sambrook J., et al., Molecular Cloning: 
15 A Laboratory Manual, 2nd edn., Cold Spring Harbor Laboratory, New York, 
1989). Also other basic molecular biology protocols were performed following 
this manual if not otherwise stated. All the restriction and modification enzymes 
utilised are from NEB (New England Biolabs, UK) or from Roche Diagnostics. 



20 Example 2: Transformation of Z bailii 

Transformations of all the Z bailii and the S. cerevisiae (NRRL Y-30320) strains 
were performed basically according to the LiAc/PEG/ss-DNA protocol (Agatep, 
R., et al., 1998, Transformation of Saccharomyces cerevisiae by the lithium 
acetate/single-stranded carrier DNA/polyethylene glycol (LiAc/ss-DNA/PEG) 

25 protocol. Technical Tips Online (http://tto.trends.com)). After the transformation, 
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Z bailii cells were recovered with an incubation of 16 hours in YP medium, 
comprising 2% w/V of fructose as carbon source (YPF), and 1 M sorbitol, at 
30°C. The cell suspension was then plated on selective YPF plates with 200 mg/1 
G418 (Gibco BRL, cat. 11811-031). Single clones appeared after 2-3 days at 
5 30°C. From then on the transformants were grown either in rich or in minimal 
medium having glucose as carbon source and 200 mg/1 G418 for maintenance of 
the selection. For S. cerevisiae cells, the procedure was the same, except for the 
carbon source, that remained glucose in all the steps, and for the G418 
concentration, optimised for our strain to 500 mg/1. 

10 

Example 3: Expression and secretion of Interleukin 1-P in Z. bailii 
In order to check the secretory capability of the yeast Z. bailii and to compare it 
with the well known host S. cerevisiae, both yeasts were transformed (according 
to Example 2) with the plasmid pZ3klIL-ip (Fig. 2a). Independent transfoimants 

15 were shake flask cultured in minimal medium (YNB, 1.34% w/V YNB from 
Difco Laboratories, Detroit, MI #919-15, 5% w/V Glucose, complemented with 
Histidine, Uracil and Leucine, Fig. 5a, left panel) or in rich medium (YPD, 5% 
w/V Glucose, 2% w/V Peptone, 1% w/V Yeast extract, Fig. 5a, right panel). Fig. 
5a shows the cell density (OD 660nm) and the glucose consumption during the 

20 kinetics of growth. The glucose consumption was determined using a 
commercially available enzymatic kit from Boehringer Mannheim GmbH, 
Germany (Cat # 716251), according to the manufacturer's instructions. During the 
kinetics, samples were collected at the indicated times (see "hours" of Fig. 5b, c, 
d). Cells were harvested (a culture volume corresponding to 10 8 cells) by 

25 centrifugation (10 min 10.000 rpm). 1 volume 2X Laemmli Buffer (Laemmli, 
U.K., 1970, Nature 227, 680-5) was added to the supernatants of said samples, 
they were boiled 3-5 minutes and stored at -20°C until loading or loaded directly 
on a polyacrylamide gel. 

The cell pellets of said samples were resuspended in 5ml 20% TCA, centrifiiged 
30 (10 min at 3000 rpm) and the resulting pellets were resuspended in 150jil 5% 
TCA. Samples were subsequently centrifiiged for 10 min at 3000 rpm, and the 



t 
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pellet was resuspended in Laemmli Buffer (lOOjil). In order to neutralise the 
samples, 1 M Tris base was added (50^1). After 3-5 min at 99°C the samples are 
ready to be loaded on a polyacrylamide gel (alternatively, they can be stored at - 
20°C). 

5 Samples were loaded on standard polyacrylamide gels (SDS-PAGE, final 
concentration of the separating gel: 15%); after protein separation, gels were 
blotted (1 h, 250 mA) to nitrocellulose membranes (protran BA 85, Schleicher & 
Schuell). Immunodecoration: after lh (RT) of saturation in TBS IX (1.2 g/1 Tris 
base; 9 g/1 NaCl) + 5% NFM (non fat milk), 0.2% Tween-20, the membranes were 

10 incubated overnight at 4°C with the primary antibody against interleukin (rabbit 
polyclonal antibody IL-1 P(H-153) from Santa Cruz Biotechnology, Inc. cat. n° sc- 
7884) diluted 1:200 in TBS IX (1.2 g/1 Tris base; 9 g/1 NaCl) + 5% NFM. After 
intensive and repeated washes in TBS + 0.2% Tween-20, the secondary antibody 
(antirabbit IG horseradish peroxidase-conjugated, Amersham Biosciences, UK cat 

1 5 n° NA934) was added (1 :10.000 in TBS IX + 5% NFM) and left in incubation for 
lh (RT). The proteins were visualised using ECL Western Blotting System 
(Amersham Biosciences, UK) according to the manufacturer's instructions. 
The data obtained by Western Blot performed" on the supernatant highlight the 
surprisingly good secretory capability of Z. bailii cells (see Fig. 5c), both in 

20 minimal and in rich medium. Remarkably, the signal corresponding to the secreted 
protein is significantly more intense compared to the signal obtained from S. 
cerevisiae cells, in agreement with the lower signal revealed in Z bailii crude cell 
extracts (Fig. 5b). Moreover, the difference in the secreted levels of proteins is 
even more pronounced in minimal medium respect than in rich medium (for a 

25 comparison: Fig. 5c, left and right panel). These conclusions can be done either 
considering samples loaded rectifying the OD (Fig. 5c) or either considering equal 
volumes of loaded samples (Fig. 5d). 

Similarly, Z bailii and S. cerevisiae cells were transformed with the plasmid 
pZ 3 ppaIL-ip. In this case the same protein (interleukin) is functionally fused with 
30 the leader sequence of the S. cerevisiae a-factor pheromone. As previously 
described, cells were shake flask cultured in rich YPD or in minimal YNB 
medium, samples were collected and prepared for protein SDS-PAGE separation. 
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The Western Blot (Fig. 6a) once more revealed the surprisingly better secretion 
occurring in Z. bailii if compared to S. cerevisiae: the signals obtained from the 
crude extracts (i for YPD, Hi for YNB medium) are more intense in the latter 
strain, suggesting that the product is shorter retained and therefore more efficiently 
5 secreted from Z bailii cells. This observation is consistent with the fact that the 
signals corresponding to the product secreted into the medium are more intense in 
Z bailii samples than in S. cerevisiae ones (ii for YPD, iv for YNB medium; in 
this case a positive signal is present only in Z bailii samples). 
Importantly, the process of expression, secretion and accumulation of 

10 heterologous proteins in the culture medium can be obtained not only by changing 
the leader sequence, but also by utilising the same leader sequence but changing 
the heterologous protein expressed. Z bailii cells were transformed with the 
plasmid pZappocGFP, shake flask cultured in minimal YNB medium, samples 
were collected and prepared for protein SDS-PAGE separation. The Western Blot 

15 analyses performed as previously described, except for the primary antibody 
utilised (anti-GFP, Clontech, Inc.) and its concentration (1:500), show a band of 
the expected dimension that is present only in the supernatant of the Z bailii cells 
expressing the GFP heterologous protein (Fig. * 6b) and not in the control strain, 
transformed with the empty plasmid. 

20 The data obtained underline the possibility to utilise Z bailii as a host for the 
process to express different heterologous proteins and to secrete them, leading the 
secretion with heterologous leader sequences. Remarkably, the level of secreted 
proteins are higher compared with the levels obtained in S. cerevisiae, and the 
difference is even more pronounced, in chemically defined culture medium. 

25 

Example 4: Expression and secretion of Interleukin 1-p in a Z. bailii bioreactor 
batch cultivation with high sugar concentration. 

Z. bailii cells transformed (according to Example 2) with the plasmid pZ3klEL-lp 
(Fig. 2a) and previously analysed for interleukin 1-P production in shake flask 
30 culture (see Example 3), were batch cultivated in a 2 1 laboratory bioreactor 
(fermentor, Biolafitte & Moritz, Mod. Prelude « France) in a chemically defined 
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medium with high glucose content (27% w/V Glucose, 4% w/V (NH 4 )2S0 4 , 0.4% 
w/V MgS0 4 , 2.4% w/V KH 2 P0 4 , vitamins according to Verduyn, C, et al., 1992, 
Yeast 8, 501-17, wherein the final concentration of vitamins was set to be 24 
times in respect to the indicated concentrations and trace elements according to 
5 Verduyn, C, et al., 1992, Yeast 8, 501-17, wherein the final concentration of trace 
elements was also set to be 24 times in respect to the indicated concentrations. 
(Depending on the salt tolerance of the production strain it might be useful in this 
context to add only a partial quantity of the salts with the glucose to the initial 
medium and to add the rest of the salts after the bioreaction (fermentation) has 
10 proceeded a sufficient amount of time.) The pH control (value: pH 5) is performed 
by the addition of 2M KOH. G418 was added to a concentration of 200mg/l G41 8, 
antifoam was added as necessary). The inoculum was prepared by pre-growing the 
yeast in shake flask (with a headspace-to-culture volume ratio of 4) in YPD rich 

medium (see above), with the addition of 200mg/l G418. Cells were harvested, 

* 

1 5 washed with deionised water and inoculated in the final medium at OD 1.68 in the 
bioreactor. Cell culture was flushed with 90 1/h of air and the dissolved oxygen 
concentration was maintained at 40% of air saturation, varying the stirrer speed. 
Fig. 7a shows the growth kinetics (cell density, OD 660nm), together with the 
glucose consumption, the ethanol production and the biomass produced (dry 

20 weight g/1). The glucose consumption and the ethanol production were determined 
by using commercial enzymatic kits (Boehringer Mannheim GmbH, Germany Kits 
Cat # 716251 and 0176290, respectively), according to the manufacturer's 
instructions. The determination of the cellular dry weight (biomass) was 
performed as described before (Rodrigues, F. et al, 2001, Appl. Environ. 

25 Microbiol. 67, 2123-8). Samples were collected at the indicated times and 
prepared for protein SDS-PAGE separation. The Western Blot analysis 
(performed as described in Example 3) shows a very strong and clean signal 
accumulating during time corresponding to the secreted product (lanes 2 to 5), and 
confirms the minimal retention of heterologous protein produced within the cells 

30 (lanes 6 to 9, Fig. 7b). This example shows the surprising and advantageous 
characteristic of Z bailii cells to be able to grow as well as express and secrete a 
heterologous protein even at very high sugar concentrations. Reportedly S. 



WO 2004/042036 



PCT/EP2003/012377 



31 



cerevisiae does not grow any more or can grow only very poorly at such high 
sugar concentrations (see for example Porro, D., et al., 1991, Res. Microbiol. 142, 
535-9). 



5 Example 5: Expression and secretion of Glucoamylase in Z. bailii. 

Z. bailii cells were transformed (according to Example 2) with the plasmid 
pZ 3 GAA (Fig. 3), and with the empty plasmid pZ3, as a control. Independent 
transformants were shake flask cultured in minimal YNB medium with 2% w/V 
Glucose as a carbon source (+0.67 % w/V YNB and aa, according to the 

10 manufacturer's protocol) till mid-exp phase (also referred to as mid-log). The P- 
glucoamylase activity was determined as follows: after cell density determination, 
the cells were harvested in order to rescue the culture supernatant. \5\iVm\ 3M 
NaAc, pH 5.2 and 20 ul/ml 1% w/V Starch (Fluka 85642 - high solubility -) were 
added. Subsequently, the samples were mixed well and incubated at the desired 

15 temperature (this experiment: 50°C). At time zero and every following 20 min, 
1 ml of the incubated medium is taken, ice-cooled for 2 min, 50ul of Lugol 
solution (Fluka 62650) were added, shaken quickly and read at the 

■ t IT • 

spectrophotometer at A.580 nm. The slope of the resulting values corresponds to 
the glucoamylase activity. Fig. 8 shows the glucoamylase activity of three 
20 independent clones expressing the GAA and one negative control. The enzymatic 
activity is expressed in mU/OD, and it is calculated considering that 1U 
corresponds to the variation of 1 OD in 1 min. The values reported in the graphic 
were subtracted of the basic activity level of Z. bailii, as measured in the control 
sample. 

25 Z. bailii and S. cerevisiae cells were transformed (according to Example 2) with 
the plasmids PZ3STA2 and pZ 3 klSTA2, and with the empty plasmid pZ 3 , as a 
control. Independent transformants were shake flask cultured in minimal YNB 
medium with 2% w/V fructose as a carbon source (+0.67 % w/V YNB and aa, 
according to the manufacturer's protocol) till mid-exp phase (also referred to as 

30 mid-log). The oc-glucoamylase activity was determined according to the literature 
(Modena et al. y 1986, Arch of Biochem. And Biophys. 248, 138-50) as follows: 
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after cell density determination, the cells were harvested in order to rescue the 
culture supernatant, and an aliquot of said supernatant is used for preparing the 
following reaction mix: 
Supernatant 1 OOjj.1 

5 Maltotriose 400mM 6.3pl 
NaAc 200mM pH 4.6 125^1 
H 2 0 18.7^U 
total 250^1 

The mix is incubated for 1 hour at 37°C under slow agitation, and after that time 
10 an aliquot of said mixture is used to evaluate the reaction. The product of 
maltotriose degradation is glucose, and its concentration can be determined using 
a commercially available enzymatic kit from Boehringer Mannheim GmbH, 
Germany (Cat # 716251). 1U of glucoamylase specific activity is the quantity of 
enzyme necessary to release 1 nmol min-1 of glucose in said condition. 

15 

Example 6: Expression of |}-galactosidase (|3-gal) in Z. bailii 
Z. bailii cells were transformed (according to Example 2) with the plasmid 
pZ3LacZ (Fig. 3b), with the plasmid pZ3&TLacZ (Fig. 4b), with the plasmid 
pZ3rGLacZ, and with the empty plasmid pZ 3 , as a control. Independent 

20 transformants were shake flask cultured in YPD medium (see description above) 
with 2% w/V Glucose as a carbon source till mid-exp phase. P-galactosidase 
activity determination: after cell density determination, 1 ml culture is harvested 
into an eppendorf tube, spun for 5 minutes (to get a hard pellet), aspirated with a 
pipet, (not using the vacuum line!), washed in 1 ml Z buffer [w/o BME - 

25 betamercaptoethanol Z buffer: 16.1g/l Na 2 HP0 4 .7H 2 0, 5.5g/l NaH 2 P0 4 .H 2 0, 
0.75g/l KC1, 0.246g/l MgS0 4 .7H 2 0], repelleted, suspended in 150^x1 Z buffer 
(with BME, 27^1/1 0ml), 50jxl chloroform are added, 20^1 0.1% SDS and vortexed 
vigorously for 15"! 700^1 of pre-warmed ONPG (o-nitrophenyl p-D- 
galactopyranoside, Sigma N-1127, 1 mg/ml in Z+BME) are added, and the 

30 reaction is started at 30°C (20' to 3hr), checking the time. When the suspension 
turns yellow the reaction is stopped by addition of 0.5 ml of 1 M NaCCb; after 
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centrifugation for 10 min at maximum speed the sample is read at the 
spectrophotometer at X420. 

Fig. 8b shows the p-gal activity of three independent clones expressing the p-gal 
under control of the Z bailii TPI promoter, two independent clones expressing the 
5 p-gal under control of the S. cerevisiae TPI promoter and one negative control 
(see the legend of the figure for indications of the respective clones). The 
enzymatic activity is expressed as Miller Unit/OD and it is calculated according to 
the following formula: 

A420 x 1 000 

1 0 Miller Units = 

A&so x time (min) x Vol (ml) 

As it is readily visible, the expression from the endogenous TPI promoter is much 
stronger (4-5 times) than from the respective promoter from S. cerevisiae. 

A similar series of experiments was performed in order to evaluate the efficiency 
15 of the plasmids based on the sequences of the endogenous Z bailii plasmid in 

improving the expression levels of heterologous proteins. Z bailii cells were 

transformed (according to Example 2) with the following plasmids: pZ3LacZ (Fig. 

3b), pl95LacZ, pEZ-IALacZ, pEZ-IAFLacZ, pEZ 2 LacZ and pEZ 2 -IBLacZ. 

Independent transforrnants were grown till mid-log phase and p-galactosidase 
20 activity measured, as previously described. The corresponding data are reported in 

Fig. 10b. 



Example 7: Isolation of an endogenous Z bailii plasmid 

Z bailii strains ATCC 36947 and NCYC 1427 were cultivated and their 
25 endogenous plasmid was extracted, resulting in the plasmids pZBj and pZB 5 (see 
Figs. 9 a and b). The protocol used was a modification of a protocol by Lorincz, 
A., 1985, BRL Focus 6, 1 1, and uses glass beads to break the cells. After the DNA 
extraction, samples were loaded on an agarose gel and the band corresponding to 
the plasmid was eluted (Qiagen, QIAquick Gel Extraction Kit cat n° 28704). 
30 The plasmid extracted from NCYC 1427 was cut with EcoRI and some of the 
fragments were sequenced. These sequences correspond to SEQ ID No.: 63, SEQ 
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ID No.: 64, SEQ ID No.: 65, SEQ ID No.: 66, SEQ ID No.: 67, SEQ ID No.: 68, 
SEQ ID No.: 69 or SEQ ID No.: 70, respectively. 

Example 8: Sequence amplification of the open reading frames and of structural 
5 sequences of the endogenous Z. bailii plasmids 

The genomic DNA extracted from the Z bailii strains ATCC 36947 and NCYC 
1427 were used as a template for the amplification of the open reading frames and 
of structural sequences of the endogenous Z bailii plasmids. 
The oligos for the amplification are the following: 
1 0 5FLP (SEQ ID NO. : 84) 

5'-TAGCTACTCTTCTCCAGGTGTCATTAG-3' Tm: 63.4 

3FLP (SEQ ID NO.: 85) 

S'-CCTATGTCCGAGTTTAGCGAGCTTG-S' Tm: 64.6 

5TFC (SEQ ID NO.: 86) 
15 5*-AGAATGAACTCAGAGTTCTCTCTTG-3* Tm: 59.7 

3TFC (SEQ ID NO.: 87) 

5'-ATTCTATTGGGTATGTCCCCTG-3' Tm: 58.4 

5TFB (SEQ ID NO.: 88) 

5'-GTTTTTAATTTTGAAGCTCACCTTTAATTG-3' Tm: 58.6 

20 3TFB (SEQ ID NO. : 89) 

5'-ATTATGTTCTCCAGGGAAGAGGTTAG-3 ' Tm: 6 1 .6 

5IRAARS (SEQ ID NO.: 90) 

5'-AGAATCAATCATTTAGTGTGGCAGGAG-3' Tm: 61.9 

3IRAARS (SEQ ID NO.: 91) 
25 S'-TAAAAACTGCCCGCCATATTTCGTC-S' Tm: 61.3 



The following program was used for the amplification: 
94°C 5min 
94°C Us 1 

58°C 30s \ 25 cycles 

72°C 2min J 
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72°C 



7min 



4°C 



00 



The amplified fragments, sub-cloned into the vector pST-Bluel (Novagen, Perfect 
Blunt cloning Kit cat. no. 70191-4), were sequenced and correspond to SEQ ID 
No.: 71 (IR-ARS), SEQ ID No.: 72 (FLP), SEQ ID No.: 74 (TFB) and SEQ ID 
5 No.: 76 (7FC), respectively. 

These coding sequences are used for the construction of the expression plasmid 
pEZi, according to Figure 9b. 

Example 9: Construction of expression plasmids based on replication and 

1 0 stability sequences from the Z bailii pSB2 plasmid 

The backbone of the new vectors is the basic S. cerevisiae multicopy plasmid 
Yeplac 195 (Gietz and Sugino, 1988, Gene 74, 527-34) modified to the expression 
plasmid pBR195, as described in Branduardi (2002, Yeast 19, 1 165-70). 
For the construction of the plasmid pi 95, the plasmid pBR195 was cut 

15 Aatn/Apal-blunt in order to excise the URA marker and the Kan R cassette, 



■ r 

excised Sphl/SacI-blunt from pFA6-KanMX4 (Wach et al. t 1994 Yeast 10, 1793- 
1808) was here inserted. From this plasmid derives the plasmid pl95LacZ: the 
LacZ gene was sub-cloned from the plasmid pZ 3 LacZ cut Sphl/Nhel into the new 
plasmid pi 95, opened with the same enzymes. 

20 For the construction of the plasmids pEZ-IA and pEZ-IALacZ, the plasmids pi 95 
and pi 95 LacZ were opened Narl/StuI-blunt, in order to remove the S. cerevisiae 
2jLun-ori. The PCR fragment corresponding to the IR-A and ARS sequence from 
the pSB2 (see previous example for amplification detail) was excised EcoRI-blunt 
from the pST-Bluel plasmid and sub-cloned into the opened vectors just 

25 described. 

For the construction of the plasmid pEZ-IAFLacZ, the plasmid pEZ-IALacZ was 
Smal opened, and there the fragment corresponding to the FLP and the sequence 
containing its promoter, derived from the pST-Bluel plasmid opened Accl- 
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blunt/SnaBI, was there sub-cloned. Said sequence was PCR amplified from the 
genomic DNA extracted from the Z bailii strains ATCC 36947. 
The oligos for the amplification are the following: 
pFLP (SEQIDNO.:95) 

5 - ACGC AAG AG AG AACTCTGAGTTC AT-3 1 Tm: 61.3 
3FLP (SEQIDNO.:85) 

5-CCTATGTCCGAGTTTAGCGAGCTTG-3' Tm: 64.6 
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The following program was used for the amplification: 



94°C 


5min 




94°C 


15s 


1 


58°C 


30s 


}* 29 cycles 


72°C 


lmin 30s 


j 


72°C 


7min 




4°C 


oo 





For the construction of the plasmids pEZ2 and pEZ 2 LacZ, the plasmids pEZ-IA 
and pEZ-IALacZ were opened Smal and the PCR fragment corresponding to the 
sequences of FLP and TFC and the respective promoters was excided SnaBI/Sall- 
5 blunt from the pST-Bluel plasmid and sub-cloned into the opened vectors just 
described. 

The oligos for the amplification are the following: 
5FLP(SEQIDNO.:84) 

S'-TAGCTACTCTTCTCC AGGTGTC ATTAG-3 ' Tm: 63.4 
10 3 TFC (SEQ ID NO.: 87) 

5 f -ATTCTATTGGGT ATGTCCCCTG-3 ' Tm: 58.4 

r . r 

The following program was used for the amplification: 



94°C 


5min 




94°C 


15s 


1 


58°C 


30s 


} 25 cycles 


72°C 


lmin 30s 


j 


72°C 


7min 




4°C 


oo 





For resulting in the plasmid pEZ2 an additional cloning step was required, in order 
1 5 to re-insert the polyA: the polyA was excised Nael/Nhel-blunt from the plasmid 
pYX022 and was sub-cloned in the transitory plasmid BamHI-blunt and de- 
phosphorylated. 

For the construction of the plasmid pEZ 2 -IBLacZ, the plasmid pEZ2LacZ was 
opened Sail-blunt and de-phosphorylated, and the fragment IR-B was therein sub- 
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cloned. That fragment was EcoRI-blunt extracted from pST-Bluel (see previous 
example). 

Example 10: Plasmid stability determination 

The stability of the plasmids described in the previous example was determined as 
follows: independent Z. bailii transformants bearing the different plasmids were 
inoculated at a cellular density of 5x1 0 3 cells/ml in rich media (YPD) and in rich 
selective media (YPD + G418), respectively. At To of the inoculum and then after 
10 and 20 generations, 500 cells from any culture were plated 3 times on selective 
and non-selective agar plates, and subsequently incubated at 30°C till the colonies 
became visible. The ratio between the mean of the colony number grown on 
selective medium and the mean of the colony number grown on non selective 
medium gives the percentage of mitotic stability. 
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Claims: 



1 . Process for the production of a protein comprising 

a) culturing a Zygosaccharomyces bailii strain 

b) expressing and secreting the protein 

c) isolating the protein. 

2. The process of claim 1, wherein the Z bailii strain is transformed with a 
vector comprising a DNA sequence coding for the protein, functionally linked 
to a signal sequence leading to the secretion of the protein and further 
functionally linked to a promoter. 

3. The process of claim 2, wherein the vector is an extra-chromosomal plasmid. 

4. The process of claim 3, wherein the plasmid is derived from an endogenous 
episomal plasmid from a Z bailii strain. 

5. The process of claim 2, wherein the plasmid comprises sequences for 

replication, stabilization and/or plasmid copy number control, obtainable from 

■% 

Z bailii. 

6. The process of claim 4, wherein the plasmid comprises at least 35 bases of one 
of the sequences selected from the list of: SEQ ID No.: 63, SEQ ID No.: 64, 
SEQ ID No.: 65, SEQ ID No.: 66, SEQ ID No.: 67, SEQ ID No.: 68, SEQ ID 
No.: 69, SEQ ID No.: 70 or SEQ ID No.: 71. 

7. The process of claims 2-6, wherein the promoter is a triose-phosphate 
isomerase promoter, obtainable from Saccharomyces cerevisiae or from Z 
bailii, preferably from Z bailii. 

8. The process of claims 2-6, wherein the promoter is a glyceraldehyde 
phosphate dehydrogenase promoter, obtainable from Saccharomyces 
cerevisiae, Z bailii or Z ronxii, preferably from Z.rouxii. 

9. The process of claims 2-8, wherein the signal sequence is a continuous stretch 
of 15 to 60 amino acids, comprising one or more positively charged amino 
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acid(s) followed by a stretch of about 5 to 1 0 hydrophobic amino acids, which 
may or may not be interrupted by non-hydrophobic residues. 

10. The process of claims 2-8, wherein the signal sequence is selected from the list 
of: SEQ ID NO.: 1 , SEQ ID NO.: 3, SEQ ID NO.: 5, SEQ ID NO.: 7, SEQ CD 
NO.: 9, SEQ ID NO.: 1 1, SEQ ID NO.: 13, SEQ ID NO.: 15, SEQ ID NO.: 17, 
SEQ ID NO.: 19, SEQ ID NO.: 21, SEQ ID NO.: 23, SEQ ID NO.: 25, SEQ 
ID NO.: 27, SEQ ED NO.: 29, SEQ ID NO.: 31, SEQ ID NO.: 33, SEQ ID 
NO.: 35, SEQ ID NO.: 37, SEQ ID NO.: 39, SEQ ID NO.: 41, SEQ ID NO.: 
43, SEQ ID NO.: 45, SEQ ID NO.: 47, SEQ ID NO.: 49, SEQ ID NO.: 51, 
SEQ ID NO.: 53, SEQ ID NO.: 55, SEQ ID NO.: 57, SEQ ID NO.: 59, SEQ 
ID NO.: 61. 

1 1 . The process of claim 1 , wherein the Z bailii strain is transformed with a 
vector comprising the DNA sequence coding for the protein, functionally 
linked to the signalling pre-sequence of the alpha-subunit of the Kl killer 
toxin of Kluyveromyces lactis and further functionally linked to the triose- 
phosphate isomerase promoter from S. cerevisiae. 

12. The process of claim 1 1 , wherein the vector is the plasmid pZ3kl as shown in 
figure 1 b. 

13. The process of claim 1, wherein the Z bailii strain is transformed with a 
vector comprising the DNA sequence coding for the protein, functionally 
linked to the signal sequence of the pre-pro cx-factor of S. cerevisiae and 
further functionally linked to the triose-phosphate isomerase promoter from S. 
cerevisiae. 

14. The process of claim 13, wherein the vector is the plasmid pZ3ppa as shown 
in figure 1 c. 

15. The process of claims 2-14, wherein the DNA sequence coding for the protein 
is derived from animal, bacterial, fungal, plant or viral sources. 

16. The process of claims 2-15, wherein the Z bailii strain that is transformed is 
selected from the list of: ATCC 36947, ATCC 60483, NCYC 1427 or ATCC 
8766. 
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17. The process of one of the preceding claims, wherein the Z. bailii strain has 
been subjected to a selection process for improved secretion. 

18. The process of one of the preceding claims, wherein the Z. bailii strain is 
cultivated in a chemically defined medium. 

19. The process of one of the preceding claims, wherein the protein is isolated 
from the culture medium. 

20. A Z bailii strain, expressing and secreting a heterologous protein. 

21. The Z. bailii strain of claim 20, wherein the cells are transformed with a vector 
comprising a DNA sequence coding for the heterologous protein, functionally 
linked to a signal sequence leading to the secretion of the protein and further 
functionally linked to a promoter. 
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P779.ST25 
SEQUENCE LISTING 

<110> Porro, Danilo 

<120> Process for expression and secretion of proteins by the non-conventional 
yeast Zygosaccharomyces bailii 

<130> p 779wo 

<150> DE 10252245.6 
<151> 2003-11-07 

<160> 95 

<170> Patentln version 3.1 

<210> 1 

<211> 57 

<212> DNA 

<213> Saccharomyces cerevisiae 
<220> 

<221> sig_peptide 

<222> (1)..(57) 
<223> 

<400> 1 

cgacagtaaa ttttgccgaa tttaatagct tctactgaaa aacagtggac catgtga 57 

<210> 2 

<211> 19 

<212> PRT 

<213> Saccharomyces cerevisiae 
<220> 

<221> SIGNAL 

<222> (1) . . C19) 

1/45 



WO 2004/042036 
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<223> 



p779.ST25 



<400> 2 

Met Arg Phe Pro ser lie Phe Thr Ala val Leu Phe Ala Ala Ser ser 
15 10 15 

Ala Leu Ala 

* 

<210> 3 
<211> 255 
<212> DNA 

<213> saccharomyces cerevisiae 



<220> 

<221> sig_peptide 
<222> (D..C255) 
<223> 



<400> 3 

cgacagtaaa ttttgccgaa tttaatagct tctactgaaa aacagtggac catgtgaaaa 60 

gatgcatctc atttatcaaa cacataatat tcaagtgagc cttacttcaa ttgtattgaa 120 

gtgcaagaaa accaaaaagc aacaacaggt tttggataag tacatatata agagggcctt 180 

ttgttcccat caaaaatgtt actgttctta cgattcattt acgattcaag aatagttcaa 240 

acaagaagat tacaa 255 

<210> 4 

<211> 85 

<212> PRT 

<213> Saccharomyces cerevisiae 
<220> 

<221> SIGNAL 

<222> (1)..(85) 
<223> 



<220> 



2/45 



WO 2004/042036 
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<221> PROPEP 
<222> (20) . . C85) 
<223> 



p779.5T25 



<400> 4 

Met Arg Phe Pro Ser lie Phe Thr Ala val Leu Phe Ala Ala Ser Ser 
1 5 10 15 

Ala Leu Ala Ala Pro val Asn Thr Thr Thr Glu Asp Glu Thr Ala Gin 

20 25 30 

lie Pro Ala Glu Ala val lie Gly Tyr Leu Asp Leu Glu Gly Asp Phe 
35 40 45 

Asp val Ala val Leu Pro Phe Ser Asn Ser Thr Asn Asn Gly Leu Leu 
50 55 60 

Phe lie Asn Thr Thr lie Ala Ser lie Ala Ala Lys Glu Glu Gly Val 
65 70 75 80 



Ser Leu Asp Lys Arg 

85 



<210> 5 

<211> 63 

<212> DNA 

<213> Aspergillus niger 



<220> 

<221> si g_pepti de 

<222> CD . . C63) 
<223> 



<400> 5 

atgatggtcg cgtggtggtc tctatttctg tacggccttc aggtcgcggc acctgctttg 60 



get 



63 



<210> 6 

<211> 21 

<212> PRT 

<213> Aspergillus niger 
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P779.ST25 

<220> 

<221> SIGNAL 
<222> (1)..(21) 
<223> 



<400> 6 

Met Met val Ala Trp Trp ser Leu 
1 5 

Ala Pro Ala Leu Ala 

20 

<210> 7 

<211> 93 

<212> DNA 

<213> Bacillus sp. 



Phe Leu Tyr Gly Leu Gin Val Ala 
10 15 



<220> 

<221> sig_peptide 

<222> CD . ■ (93) 
<223> 

<400> 7 

atgcaaaaca cagcgaaaaa ctccatctgg cagagggtgc gccacagcgc cattgcctta 60 
tccgctctca gtttatcctt tggcctgcag gcc 93 

<210> 8 

<211> 31 

<212> PRT 

<213> Bacillus sp. 



<220> 

<221> SIGNAL 
<222> CD . • (31) 
<223> 



4/45 



t 
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P779.ST25 

<400> 8 

Met Gin Asn Thr Ala Lys Asn Ser lie Trp Gin Arg val Arg His Ser 
1 5 10 15 

Ala lie Ala Leu Ser Ala Leu Ser Leu Ser Phe Gly Leu Gin Ala. 

20 25 30 

<210> 9 
<211> 63 
<212> DNA 

<213> Aspergillus oryzae 



<220> 

<221> sig_peptide 

<222> CI) . . (63) 
<223> 



<400> 9 

atgcagtcca tcaagcgtac cttgctcctc ctcggagcta tccttcccgc ggtcctcggt 60 



gcc 

<210> 10 

<211> 21 

<212> PRT 

<213> Aspergillus oryzae 



63 



<220> 

<221> SIGNAL 

<222> (1)..(21) 
<223> 



<400> 10 

Met Gin Ser lie Lys Arg Thr Leu Leu Leu Leu Gly Ala lie Leu Pro 
15 10 15 

Ala val Leu Gly Ala 

20 

<210> 11 
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P779.ST25 

<211> 75 
<212> DNA 

<213> Bacillus amyloliquefaciens 
<220> 

<221> si g_pepti de 
<222> (1)..(75) 
<223> 



<400> 11 

atgaaacgag tgttgctaat tcttgtcacc ggattgttta tgagtttgtg tgggatcact 60 
tctagtgttt cggct 75 

<210> 12 

<211> 25 

<212> PRT 

<213> Bacillus amyloliquefaciens 



<220> 

<221> SIGNAL 
<222> Cl).-C25) 

r T 

<223> 



<400> 12 

Met Lys Arg Val Leu Leu lie Leu Val Thr Gly Leu Phe Met Ser Leu 
15 10 15 

cys Gly lie Thr ser Ser val ser Ala 

20 25 

<210> 13 
<211> 51 
<212> DNA 

<213> Saccharomycopsi s fibuligera 



<220> 

<221> sig_peptide 
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<222> (1)..(51) 
<223> 



P779.ST25 



<400> 13 

atgttgatga tagtacagct tttggtcttt gcactaggcc ttgctgttgc t 



51 



<210> 14 

<211> 17 

<212> PRT 

<213> saccharomycopsis fibuligera 



<220> 

<221> SIGNAL 
<222> CD • • (17) 
<223> 



<400> 14 

Met Leu Met lie Val Gin Leu Leu Val Phe Ala Leu Gly Leu Ala Val 
1 5 10 15 



Ala 



<210> 15 

<211> 51 

<212> DNA 

<213> Saccharomycopsis fibuligera 



<220> 

<221> sig_peptide 

<222> CI) - ■ (51) 
<223> 



<400> 15 

atgttgttga ttttggaact cttagtactt attatagggc ttggagttgc t 



51 



<210> 16 
<211> 17 
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P779.ST25 

<212> PRT 

<213> saccharomycopsis fibuligera 



<220> 

<221> SIGNAL 
<222> (1) . . C17) 
<223> 



<400> 16 

Met Leu Leu lie Leu Glu Leu Leu val Leu lie He Gly Leu Gly val 
15 10 15 

Ala 



<210> 17 

<211> 66 

<212> DNA 

<213> Hypocrea pecorina 



<220> 

<221> sig_peptide 
<222> CD-.C66) 
<223> 



<400> 17 

atggcgccct cagttacact gccgttgacc acggccatcc tggccattgc ccggctcgtc 60 



gccgcc 

<210> 18 

<211> 22 

<212> PRT 

<213> Hypocrea pecorina 



66 



<220> 

<221> SIGNAL 
<222> CD..C22) 
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<223> 



P779.ST25 



<400> 18 

Met Ala Pro ser val Thr Leu Pro Leu Thr Thr Ala lie Leu Ala lie 
15 10 15 

Ala Arg Leu val Ala Ala 

20 

<210> 19 
<211> 63 
<212> DNA 

<213> Hypocrea pecorina 



<220> 

<221> si g_pepti de 

<222> (1)..(63) 
<223> 

<400> 19 

atgaacaagt ccgtggctcc attgctgctt gcagcgtcca tactatatgg cggcgccgtc 60 

gca 63 

<210> 20 

<211> 21 

<212> PRT 

<213> Hypocrea pecorina 



<220> 

<221> SIGNAL 
<222> CD..C21) 
<223> 



<400> 20 

Met Asn Lys Ser val Ala Pro Leu Leu Leu Ala Ala Ser lie Leu Tyr 
15 10 15 



Gly Gly Ala val Ala 

20 
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P779.ST25 

<210> 21 

<211> 48 

<212> DNA 

<213> Arxula adeninivorans 
<220> 

<221> sig_peptide 

<222> (1) . . (48) 
<223> 

<400> 21 

atgcgtcagt ttctagcact tgctgctgct gcttccatag cggtggca 48 

<210> 22 

<211> 18 

<212> PRT 

<213> Arxula adeninivorans 
<220> 

<221> SIGNAL 

<222> (1)..(18) 
<223> 

<4O0> 22 

Met Arg Gin Phe Leu Ala Leu Ala Ala Ala Ala Ser He Ala val Ala 
15 10 15 

Asp Ser 

<210> 23 

<211> 57 

<212> DNA 

<213> Homo sapiens 

<220> 
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<221> sig_peptide 
<222> (1) . • (57) 
<223> 



P779.ST25 



<400> 23 

atgcagcgac tatgtgtgta tgtgctgatc tttgcactgg ctctggccgc cttctct 



57 



<210> 24 

<211> 19 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> CD • ■ (19) 
<223> 



<400> 24 

Met Gin Arg Leu Cys Val Tyr Val Leu lie Phe Ala Leu Ala Leu Ala 
15 10 15 



Ala Phe Ser 



<210> 25 

<211> 75 

<212> DNA 

<213> Rhizopus oryzae 
<220> 

<221> sig_peptide 

<222> (1)..(75) 
<223> 



<400> 25 

atgcaactgt tcaatttgcc attgaaagtt tcattctttc tcgtcctctc ttacttttct 
ttgctcgttt ctgct 



60 
75 
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P779.ST25 

<210> 26 

<211> 25 

<212> PRT 

<213> Rhizopus oryzae 
<220> 

<221> SIGNAL 

<222> (I) . . (25) 
<223> 

<400> 26 

Met Gin Leu Phe Asn Leu Pro Leu Lys val Ser Phe Phe Leu val Leu 
15 10 15 

ser Tyr Phe ser Leu Leu val ser Ala 

20 25 

<210> 27 
<211> 48 
<212> DNA 

<213> Aspergillus niger 
<220> 

<221> si g_pepti de 
<222> (D..C48) 
<223> 

<400> 27 

atgcagactc tccttgtgag ctcgcttgtg gtctccctcg ctgcggcc 48 

<210> 28 

<211> 16 

<212> PRT 

<213> Aspergillus niger 
<220> 

<221> SIGNAL 



12/45 
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<222> (1)..(16) 
<223> 



P779.ST25 



<400> 28 

Met Gin Thr Leu Leu val ser Ser Leu val val Ser Leu Ala Ala Ala 
15 10 15 



<210> 29 
<211> 54 
<212> DNA 



<213> Homo sapiens 
<220> 

<221> si g_pepti de 

<222> CI) . . (54) 
<223> 



<400> 29 

atgaagtggg taacctttat ttcccttctt tttctcttta gctcggctta ttcc 



54 



<210> 30 

<211> 18 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> (1) . . (18) 
<223> 



<400> 30 

Met Lys Trp val Thr Phe lie Ser Leu Leu Phe Leu Phe Ser Ser Ala 
15 10 15 



Tyr Ser 



<210> 31 



13/45 



I 



WO 2004/042036 



PCT/EP2003/012377 



<211> 78 

<212> DNA 

<213> Saccharomyces 



P779-ST25 

cerevisiae 



<220> 

<221> sig_peptide 

<222> (I) . . (78) 
<223> 



<400> 31 

atgacgaagc caacccaagt attagttaga tccgtcagta tattattttt catcacatta 60 
ctacacctag tcgtagcg 78 

<210> 32 

<211> 26 

<212> PRT 

<213> Saccharomyces cerevisiae 



<220> 

<221> SIGNAL 

<222> CD - . (26) 
<223> 



<400> 32 

Met Thr Lys Pro Thr Gin Val Leu val Arg Ser val ser lie Leu Phe 
15 10 15 

Phe lie Thr Leu Leu His Leu val val Ala 

20 25 

<210> 33 
<211> 63 

<212> DNA 

<213> Saccharomyces cerevisiae 



<220> 

<221> si g_pepti de 



14/45 
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<222> (1) . . (63) 



P779.ST25 



<223> 



<400> 33 

atgggccact tagcgatcct tttcagtatt atcgctgtat tgaatatagc tacagctgtt 



60 



gca 



63 



<210> 34 

<211> 21 

<212> PRT 

<213> Saccharomyces cerevisiae 
<220> 

<221> SIGNAL 

<222> CD - . (21) 
<223> 

<400> 34 

Met Gly His Leu Ala He Leu Phe Ser lie He Ala Val Leu Asn He 
15 10 15 

Ala Thr Ala val Ala 

20 

<210> 35 
<211> 48 
<212> DNA 

<213> Kluyveromyces lactis 
<220> 

<221> sig_peptide 
<222> (1) . . (48) 
<223> 

<400> 35 

ataaaatgaa tatattttac atatttttgt ttttgctgtc attcgttc 48 



<210> 36 



15/45 



1 



WO 2004/042036 



PCT/EP2003/012377 



P779.ST25 

<211> 17 
<212> PRT 

<213> Kluyveromyces lactis 



<220> 

<221> SIGNAL 
<222> CD . . (17) 
<223> 



<40O> 36 

Met Asn lie Phe Tyr lie Phe Leu 
1 5 

Leu 



phe Leu Leu ser Phe val Gin Gly 
10 15 



<210> 37 

<211> 69 

<212> DMA 

<213> Kluyveromyces lactis 
<220> 

<221> sig_peptide 

<222> CD . . (69) 
<223> 



<400> 37 

ataaaatgaa tatattttac atatttttgt ttttgctgtc attcgttcaa ggtttggagc 60 



atactcatc 

<210> 38 

<211> 23 

<212> PRT 

<213> Kluyveromyces lactis 



69 



<220> 

<221> SIGNAL 
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<222> (1).. (23) 
<223> 



P779.ST25 



<400> 38 

Met Lys lie Tyr His He phe Ser val cys Tyr Leu lie Thr Leu Cys 
15 10 15 



Ala Ala Ala Ala Thr Thr Ala 

20 



<210> 39 

<211> 54 

<212> DNA 

<213> saccharomyces cerevisiae 



<220> 



<221> sig_peptide 
<222> CI). • (54) 
<223> 



<400> 39 

atgtttgctt tctactttct caccgcatgc atcagtttga agggcgtttt tggg 



54 



<210> 40 

<211> 18 

<212> PRT 

<213> saccharomyces cerevisiae 



<220> 

<221> SIGNAL 
<222> CD.-C18) 
<223> 



<400> 40 

Met phe Ala Phe Tyr phe Leu Thr Ala Cys lie Ser Leu Lys Gly val 
15 10 15 



Phe Gly 
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<210> 41 

<211> 54 

<212> DNA 

<213> Saccharomyces cerevisiae 
<220> 

<221> sig_peptide 

<222> CI).. (54) 
<223> 

<400> 41 

atgtttaagt ctgttgttta ttcggttcta gccgctgctt tagttaatgc aggt 54 

<210> 42 

<211> 18 

<212> PRT 

<213> saccharomyces cerevisiae 
<220> 

<221> SIGNAL 

<222> CD-.C18) 
<223> 



Ala Gly 

<210> 43 

<211> 51 

<212> DNA 

<213> saccharomyces cerevisiae 
<220> 



<400> 



42 



Met Phe Lys 
1 



Ser val Val Tyr Ser val Leu Ala Ala Ala Leu val Asn 
5 10 15 
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P779.ST25 

<221> sig_peptide 
<222> (D..C51) 
<223> 



<400> 43 

atgtttaaat ctgttgttta ttcaatttta gccgcttctt tggccaatgc a 51 

<210> 44 

<211> 17 

<212> PRT 

<213> Saccharomyces cerevisiae 



<220> 

<221> SIGNAL 

<222> (1)..C17) 
<223> 



<400> 44 

Met Phe Lys Ser val val Tyr Ser lie Leu Ala Ala ser Leu Ala Asn 
15 10 15 

Ala 

<210> 45 

<211> 48 

<212> DNA 

<213> Kluyveromyces lactis 



<220> 

<221> sig_peptide 

<222> CD . . (48) 
<223> 



<400> 45 

atgctatcta ttctgttgag tttattatca ttatcaggga cccatgcg 48 
<210> 46 
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<211> 16 
<212> PRT 

<213> Kluyveromyces lactis 
<220> 

<221> SIGNAL 
<222> CD . - C16) 
<223> 

<400> 46 

Met Leu Ser lie Leu Leu ser Leu Leu ser Leu Ser Gly Thr His Ala 
15 10 15 

<210> 47 

<211> 48 

<212> DNA 

<213> Kluyveromyces lactis 
<220> 

<221> sig_peptide 

<222> (1) . . C48) 

. -f • 

<223> 



<400> 47 

atgctatcta ttctgttggg tttattatca ctatcaggga cccatgcg 48 

<210> 48 

<211> 16 

<212> PRT 

<213> Kluyveromyces lactis 



<220> 

<221> SIGNAL 
<222> (1) . . (16) 
<223> 
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<400> 48 

Met Leu Ser lie Leu Leu Gly Leu Leu Ser Leu Ser Gly Thr His Ala 
1 5 10 15 

<210> 49 

<211> 54 

<212> DNA 

<213> Aspergillus niger 
<220> 

<221> sig_peptide 

<222> CD . . C54) 
<223> 



<400> 49 

atgggcgtct ctgctgttct acttcctttg tatctcctgt ctggagtcac ctcc 54 

<210> 50 

<211> 18 

<212> PRT 

<213> Aspergillus niger 
<220> 

<221> SIGNAL 

<222> (1) . . (18) 
<223> 



<400> 50 

Met Gly val Ser Ala val Leu Leu Pro Leu Tyr Leu Leu Ser Gly Val 
15 10 15 

Thr Ser 



<210> 51 

<211> 57 

<212> DNA 

<213> Saccharomyces 



cerevisiae 
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<220> 

<221> sig_peptide 
<222> (1)..(57) 
<223> 



<400> 51 

atgcttttgc aagctttcct tttccttttg gctggttttg cagccaaaat atctgca 57 

<210> 52 

<211> 19 

<212> PRT 

<213> Saccharomyces cerevisiae 



<220> 

<221> SIGNAL 
<222> CD . . (19) 
<223> 



<400> 52 

Met Leu Leu Gin Ala Phe Leu Phe Leu Leu Ala Gly Phe Ala Ala Lys 
1 5 10 15 

lie Ser Ala 



<210> 53 

<211> 63 

<212> DNA 

<213> Saccharomyces cerevisiae 



<220> 

<221> sig_peptide 
<222> (D..C63) 
<223> 



<400> 53 

atgcaaagac catttctact cgcttatttg gtcctttcac ttctatttaa ctcggctttg 60 



22/45 



* 

WO 2004/042036 

P779.ST25 

ggt 

<210> 54 
<211> 21 
<212> PRT 

<213> Saccharomyces cerevisiae 
<220> 

<221> SIGNAL 
<222> (1) . . (21) 
<223> 

<400> 54 

Met Gin Arg Pro Phe Leu Leu Ala Tyr Leu val Leu ser Leu Leu Phe 
15 10 15 

Asn Ser Ala Leu Gly 

20 

<210> 55 
<211> 63 
<212> DNA 

<213> Saccharomyces cerevisiae 
<220> 

<221> sig_peptide 
<222> CI).. (63) 
<223> 

<400> 55 

atgcaaagac catttctact cgcttatttg gtcctttcgc ttctatttaa ctcagctttg 
ggt 

<210> 56 

<211> 21 

<212> PRT 

<213> Saccharomyces cerevisiae 



PCT/EP2003/012377 
63 



23/45 



i 

t 

WO 2004/042036 PCT/EP2003/012377 

p779.ST25 

<220> 

<221> SIGNAL 
<222> (1). .(21) 
<223> 



<400> 56 

Met Gin Arg pro Phe Leu Leu Ala Tyr Leu val Leu Ser Leu Leu Phe 
15 10 15 

Asn ser Ala Leu Gly 

20 

<210> 57 
<211> 96 
<212> DNA 

<213> Saccharomyces cerevisiae 



<220> 

<221> sig_peptide 
<222> (ID . . (96) 
<223> 

■jr 

<400> 57 

atggtaggcc tcaaaaatcc atatacgcac actatgcaaa gaccatttct actcgcttat 60 
ttggtccttt cgcttctatt taactcagct ttgggt 96 

<210> 58 

<211> 32 

<212> PRT 

<213> saccharomyces cerevisiae 
<220> 

<221> SIGNAL 

<222> (1) . . (32) 
<223> 
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<400> 58 

Met val Gly Leu Lys Asn Pro Tyr Thr His Thr Met Gin Arg Pro phe 
15 10 15 

Leu Leu Ala Tyr Leu Val Leu ser Leu Leu Phe Asn Ser Ala Leu Gly 

20 25 30 

<210> 59 
<211> 63 
<212> DNA 

<213> zygosaccharomyces bailii 



<220> 

<221> sig_peptide 

<222> (1)..(63) 
<223> 



<400> 59 

atgaaagcag cccaaatatt aacagcaagt atagtaagct tattgccaat atatactagt 60 



get 

<210> 60 

<211> 21 

<212> PRT 

<213> Zygosaccharomyces bailii 



63 



r 



<220> 

<221> SIGNAL 
<222> CI).. (21) 
<223> 



<400> 60 

Met Lys Ala Ala Gin lie Leu Thr Ala ser lie val ser Leu Leu Pro 
15 10 15 



lie Tyr Thr Ser Ala 

20 



<210> 61 
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P779.5T25 

<211> 417 

<212> DNA 

<213> Zygosaccharomyces bailii 
<220> 

<221> sig_peptide 

<222> CD . . (417) 
<223> 



<400> 61 
atgaaagcag 


cccaaatatt 


aacagcaagt 


atagtaagct 


tattgccaat 


atatactagt 


60 


gctagaaaca 


tattagacag 


agaatacaca 


gcaaacgaat taaaaactgc 


ttttggagat 


120 


gaagaaattt 


ttacagattt 


gacgtatcac 


attcacgtta 


acgtcagtgg 


cgaaattgac 


180 


tcttactatc 


ataatttagt 


caattttgtc 


gataacgctc 


tagcaaacaa 


agatattaat 


240 


agatatatat 


acgctatatt 


tacacagcag 


acaaactata 


cagaggatgg 


gctcattgag 


300 


tacttaaatc 


attacgattc 


agagacttgc 


aaagatatca 


ttactcagta 


taatgttaac 


360 


gtagacacta 


gtaactgtat 


aagcaatact 


acagatcaag 


ctagactcca acgtcgc 


417 



<210> 62 

<211> 139 

<212> PRT 

r 

<213> Zygosaccharomyces bailii 
<220> 

<221> SIGNAL 

<222> CD ■ . C139) 
<223> 

<220> 

<221> PROPEP 

<222> C22)..C139) 
<223> 

<400> 62 

Met Lys Ala Ala Gin lie Leu Thr Ala Ser lie val Ser Leu Leu Pro 
15 10 15 
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lie Tyr Thr ser Ala Arg Asn lie Leu Asp Arg Glu Tyr Thr Ala Asn 

20 25 30 

Glu Leu Lys Thr Ala Phe Gly Asp Glu Glu lie Phe Thr Asp Leu Thr 
35 40 45 

Tyr His lie His val Asn Val Ser Gly Glu lie Asp Ser Tyr Tyr His 
50 55 60 

Asn Leu Val Asn Phe val Asp Asn Ala Leu Ala Asn Lys Asp lie Asn 
65 70 75 80 

Arg Tyr lie Tyr Ala lie Phe Thr Gin Gin Thr Asn Tyr Thr Glu Asp 

85 90 95 

Gly Leu lie Glu Tyr Leu Asn His Tyr Asp Ser Glu Thr Cys Lys Asp 

100 105 110 

lie lie Thr Gin Tyr Asn val Asn Val Asp Thr ser Asn Cys He Ser 
115 120 125 

Asn Thr Thr Asp Gin Ala Arg Leu Gin Arg Arg 
130 135 

<210> 63 

<211> 587 

<212> DNA 

<213> zygosaccharomyces bailii 



<400> 63 
ttatagatct 


aaaaataaat 


aatataaatt 


acctcatcca 


gagtcccccc 


ccaaatcctt 


60 


attctaaata 


ataagctact 


cctccccccc 


caggagtatt 


tttaggggga 


ggggggacct 


120 


taactcaagg 


gggagtagtt 


ttgaggatca 


catgggaagt 


atttaaataa 


atagtagttc 


180 


ttttgtttta 


aaaaggcctc 


tccaaaagta 


atacttttag 


ggtaattact 


aagtataata 


240 


tatattataa 


gtaatagcct 


ttatagctta 


atggtaaagc 


agtaaattga 


agatttacct 


300 


atatgtagtt 


cgattctcat 


taagggcaat 


ataaataagc 


tttttaatgg 


gccaatagct 


360 


gaaataagta 


atattattgt 


aaatattgag 


acttgaactc 


aaatcttatg 


cacctaaaaa 


420 


catatatttt 


aaccaattaa 


attatattta 


ctttattatt 


tacttatata 


acttctacta 


480 


attgtaaagt 


ataaccagct 


tttttgttaa 


caacaaaaac 


cgagagggtt 


catgttatat 


540 


ataatttata 


attgttctta 


ctttatttat 


aaaagaataa 


ccgaatg 




587 



<210> 64 
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<211> 435 
<212> DNA 

<213> zygosaccharomyces bailii 
<400> 64 

ttaatttaat ttgtcgctaa ataataatgt tttaaatatt ataaatattt caaccaacca 
ccccccccaa aagggggtgg ttggtggttg gtcgtcacca accacctttg gtgggtggtg 
ccccctatga gttttcatat tataaatata aaaactttta tggagggacc tataagaaat 
aattgaggaa taattaataa taagttgccc tccttttttt tctcttctcc ccaccctaaa 
aatactcctg ggggggggag ggagagaatg tatgtagtgg ggagggtgta agttaataat 
agacttaaat agagttatat aaaataacat aaatatgctt aaaaataata ataataatat 
taacagatag aagccaaagg gtcaggcgct ttctttggga gaaagagtta gttagttcga 
atctatccta tctga 

<210> 65 
<211> 299 
<212> DNA 

<213> Zygosaccharomyces bailii 
<400> 65 

ttttggggtc gaggtgccgt aaagcactaa atcggaaccc taaagggagc ccccgattta 60 
gagcttgacg gggaaagccg gcgaacgtgg cgagaaagga agggaagaaa gcgaaaggag 120 
cgggcgctag ggcgctggca agtgtagcgg tcacgctgcg cgtaaccacc acacccgccg 180 
cgcttaatgc gccgctacag ggcgcgtcag gtggcacttt tcggggaaat gtgcgcggaa 240 
cccctatttg tttatttttc taaatacatt caaatatgta tccgctcatg agacaataa 299 

<210> 66 
<211> 153 
<212> DNA 

<213> Zygosaccharomyces bailii 
<400> 66 

ctccactgta acatttccca ctgtcctttt cccatctttc attttacaat gagcaagttt 60 
cagaaaaaaa aatacaaatg ggataagtgc aaaacattcc atgtatctgt agcttccaat 120 
gttattcctc tctccagagt caggcttctg tgt 153 

<210> 67 



60 
120 
180 
240 
300 
360 
.420 
435 
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<211> 231 
<212> DNA 

<213> Zygosaccharomyces bailii 
<400> 67 

aattcaatag atgatgattt aacttcattt aatgagaaga tttcattaga ttcaacaaaa 60 
tctggagatt ttgcataaac aactgattta ttattagctt tattttctaa tccattaact 120 
aattgatcat acataatata gatgaataag aataatgaaa ctagtgcaat aattgatcca 180 
attgatgcta cataatttca accagcaaag gcatcagggt agtcaggaat t 231 

<210> 68 

<211> 52 

<212> DNA 

<213> zygosaccharomyces bailii 



<400> 68 

ctcgtaaaaa cgagcatgag ctgcgtcagg tcagccgtgg atatcgttgc gg 52 

<210> 69 

<211> 116 

<212> DNA 

<213> Zygosaccharomyces bailii 



<400> 69 

ctatctgcac gtgccaccgg aggtgctgtg ggagcgactg cggcacgatc gccatcggcc 60 
gctgctgcag gtgccgcagc cgaggcagcg cattttcgaa ctctacgccc agcgcg 116 

<210> 70 

<211> 268 

<212> DNA 

<213> Zygosaccharomyces bailii 



<400> 70 

ccggcccctt gttgcgggcg gtgccgatca ggcgcagcga gttgatctcg cgcagccggt 60 

cctgggcata attgagcaaa tcgtactcgt gcgcggcgat gcgctccttg ccgatcgaat 120 

tgacgtaatc gatcgcggcg ccgagcccga tcgcctcgac gatcggcggc gtgccggcct 180 

cgaacttgtg cggcgggtcg ccataggtga cccagtcctt ggcaacttca cggatcattt 240 

cgccgccgcc gttgaacggc cgcatcgc 268 
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<210> 71 
<211> 869 
<212> .DNA 

<213> zygosaccharomyces bail in 



<400> 71 
cacacagaaa 


cagctatgac 


catgatacgc 


caagcttaat 


acgactcact 


ataggaaagc 


60 


tcggtaccac 


gcatgctgca 


gacgcgttac 


gtatcggatc 


cagaattcgt 


gatattctat 


120 


tgggtatgtc 


ccctgattcg 


acggcgtaaa 


ttgcgtgaat 


cttgtgttgg 


cgctaatgac 


180 


cgctttttgg 


aattatgtgc 


tatgcctctg 


ccattggtat 


caacagctga 


aatatttgtt 


240 


gaagatcgaa 


tatcttctat 


tgtttctgag 


ggtatccccg 


aagctatggc 


gaaagaaagg 


300 


atctcttctc 


gtacttggat 


cggtacgaga 


agcaatagac 


gcacaatgca 


ttgacgcatc 


360 


ttgttgatac 


cgggtaatgt 


gagtcttctg 


ggttctgtta 


ttgagtttaa 


tatgtcgtcc 


420 


acctctgttc 


tcgtatccait 


tttgcgagta 


gcccgccata 


cagcacgtcc 


aatacaggag 


480 


aggccattta 


gcttcaggtg 


cagagaagac 


acagcatggt 


gctcaccttc 


gagtgtctca 


540 


atagatgatt 


gagttgactg 


ggcttccgtg 


aaagggcctt 


tcgagagatc 


ttcagaaata 


600 


aaccagggtt 


gcgcttcatt 


agtaggtgtt 


cctggaggac 


tattgtcgct 


atctgctgga 


660 


ctactgctac 


caagtagtga 


agggggtatt 


ctaaggcttt 


cactctgttc 


tgacactatt 


720 


ataacattgc 


caaggccaat 


ttgaaaggtt 


tcgcgtatat 


gagtaaagag 


ctcggtgccc 


780 


ttccagttgg 


aatcaagccg 


ttcaagcaga 


tcgagagcat 


aatcagagtc 


cacatttccg •* 


840 


cacgcaagag 


agaactctga 


gttcattct 








869 



<210> 72 

<211> 1425 

<212> DNA 

<213> Zygosaccharomyces bailii 
<220> 

<221> CDS 

<222> CD - .(1425) 

<223> 

<400> 72 

atg tec gag ttt age gag ctt gtc aga att etc cca tta gac cag gtt 48 

Met ser Glu Phe Ser Glu Leu val Arg lie Leu Pro Leu Asp Gin Val 

15 10 15 
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gca gaa ata aag cgt att ttg agt cgc ggc gac cct ata cct tta caa 96 
Ala Glu He Lys Arg lie Leu ser Arg Gly Asp Pro lie pro Leu Gin 

20 25 30 

agg tta get tct eta eta act atg gtg ate eta acg gtc aac atg tea 144 
Arg Leu Ala Ser Leu Leu Thr Met Val lie Leu Thr val Asn Met Ser 
35 40 45 

aaa aag agg aag age tct cca ate aag ctt age acc ttt act aaa tat 192 
Lys Lys Arg Lys Ser ser Pro lie Lys Leu ser Thr Phe Thr Lys Tyr 
50 55 60 

cgt aga aat gtt gcg aag tea ttg tat tat gat atg tea age aag aca 240 
Arg Arg Asn val Ala Lys Ser Leu Tyr Tyr Asp Met Ser Ser Lys Thr 
65 70 75 80 

gta ttc ttc gaa tac cat etc aaa aat aca caa gat eta cag gag ggc 288 
Val Phe Phe Glu Tyr His Leu Lys Asn Thr Gin Asp Leu Gin Glu Gly 

85 90 95 

etc gag caa gec att gcg ccc tac aat ttc gtg gta aag gtg cac aag 336 
Leu Glu Gin Ala He Ala Pro Tyr Asn Phe Val Val Lys val His Lys 

100 105 110 

aag cca att gat tgg cag aaa cag etc tea age gtg cat gag agg aaa 384 
Lys Pro He Asp Trp Gin Lys Gin Leu ser Ser val His Glu Arg Lys 
115 120 125 

gcg ggc cac aga age att etc age aac aat gtt ggc gec gag ate tct 432 
Ala Gly His Arg ser He Leu Ser Asn Asn val Gly Ala Glu lie Ser 
130 135 140 

aaa ctg get gag acg aaa gat tct act tgg agt ttt ate gag aga aca 480 
Lys Leu Ala Glu Thr Lys Asp Ser Thr Trp Ser Phe He Glu Arg Thr 
145 150 155 160 

atg gat ctg ata gaa gee cgc acc cgc cag ccc acg aca aga gtt gcg 528 
Met Asp Leu lie Glu Ala Arg Thr Arg Gin Pro Thr Thr Arg val Ala 

165 ~ 170 175 

tat agg ttt ctg ctt caa etc aca ttc atg aac tgc tgt agg get aat 576 
Tyr Arg Phe Leu Leu Gin Leu Thr Phe Met Asn cys Cys Arg Ala Asn 

180 185 190 

gat ttg aaa aac gee gac ccc age act ttt caa ate ate gca gat cct 624 
Asp Leu Lys Asn Ala Asp Pro ser Thr Phe Gin lie lie Ala Asp Pro 
195 200 205 

cac ctt ggt cgt ata ttg egg gec ttt gtt cca gag aca aag act age 672 
His Leu Gly Arg lie Leu Arg Ala Phe Val Pro Glu Thr Lys Thr Ser 
210 215 220 

att gaa agg ttt ate tat ttt ttc cca tgt aag gga cga tgc gat ccg 720 
lie Glu Arg Phe lie Tyr Phe Phe Pro Cys Lys Gly Arg Cys Asp Pro 
225 230 235 240 

ctt ttg get eta gat tec tat etc ctg tgg gtt ggc cca gtg ccc aaa 768 
Leu Leu Ala Leu Asp ser Tyr Leu Leu Trp val Gly Pro Val Pro Lys 

245 250 255 

act cag act acc gat gaa gag act caa tat gat tac cag ctt ctt caa 816 
Thr Gin Thr Thr Asp Glu Glu Thr Gin Tyr Asp Tyr Gin Leu Leu Gin 

260 265 270 

gat act etc ttg att teg tac gac agg ttt ate gec aaa gaa tea aag 864 
Asp Thr Leu Leu lie Ser Tyr Asp Arg Phe He Ala Lys Glu Ser Lys 
275 280 285 
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gaa aat att ttc aaa ata cct aat ggg ccc aaa get cat ttg ggg egg 
Glu Asn lie Phe Lys He Pro Asn Gly Pro Lys Ala His Leu Gly Arg 



290 



295 



300 



cat eta atg gca tea tac ctt gqa aac aac agt etc aag age gag gee 

His Leu Met Ala Ser Tyr Leu Gly Asn Asn ser Leu Lys Ser Glu Ala 

305 310 315 320 

aca etc tac ggc aac tgg tct gtg gaa agg caa gag ggc gtc age aaa 

Thr Leu Tyr Gly Asn Trp Ser val Glu Arg Gin Glu Gly val Ser Lys 



325 



330 



335 



atg get gac age cga tac atg cac acg gtt aaa aaa agt cca cct tea 

Met Ala Asp ser Arg Tyr Met His Thr val Lys Lys Ser Pro Pro Ser 

340 345 . 350 

tat eta ttt gca ttt tta tec ggc tac tac aaa aag tec aac caa ggc 

Tyr Leu Phe Ala Phe Leu ser Gly Tyr Tyr Lys Lys ser Asn Gin Gly 
355 360 365 

gag tac gtg ctg get gaa aca ctg tat aat ccc ctg gat tac gac aaa 

Glu Tyr val Leu Ala Glu Thr Leu Tyr Asn Pro Leu Asp Tyr Asp Lys 

370 375 380 

aca ctt cca ata aca acg aac gag aaa ttg ate tgt egg egg tac ggg 

Thr Leu Pro lie Thr Thr Asn Glu Lys Leu lie cys Arg Arg Tyr Gly 
385 390 395 400 

aaa aat gcg aaa gtg ata cca aaa gac gca ctg ctg tat etc tac acg 

Lys Asn Ala Lys val He Pro Lys Asp Ala Leu Leu Tyr Leu Tyr Thr 

405 410 415 

tat gcg cag cag aag cga aaa caa ttg gee gat ccc aat gag caa aat 

Tyr Ala Gin Gin Lys Arg Lys Gin Leu Ala Asp Pro Asn Glu Gin Asn 

420 425 430 

agg eta ttc agt agt gaa tea cca gcg cat ccc ttc tta act cct caa 

Arg Leu Phe ser Ser Glu Ser Pro Ala His Pro Phe Leu Thr Pro Gin 
435 440 445 

teg aca ggc tea teg aca ccc ttg ace tgg act get cca aag aca etc 

Ser Thr Gly Ser Ser Thr Pro Leu Thr Trp Thr Ala Pro Lys Thr Leu 

450 455 460 

tec act ggt eta atg aca cct gga gaa gag tag 

Ser Thr Gly Leu Met Thr Pro Gly Glu Glu 



912 



960 



1008 



1056 



1104 



1152 



1200 



1248 



1296 



1344 



1392 



1425 



465 



470 



<210> 73 

<211> 474 

<212> PRT 

<213> Zygosaccharomyces bailii 



<400> 73 

Met Ser Glu Phe Ser Glu Leu val Arg lie Leu Pro Leu Asp Gin val 
15 10 15 



Ala Glu lie Lys Arg lie Leu Ser Arg Gly Asp Pro lie Pro Leu Gin 

20 25 30 
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Arg Leu Ala Ser Leu Leu Thr Met val lie Leu Thr val Asn Met Ser 
35 40 45 

Lys Lys Arg Lys Ser ser Pro lie Lys Leu Ser Thr Phe Thr Lys Tyr 
50 55 60 

Arg Arg Asn val Ala Lys Ser Leu Tyr Tyr Asp Met Ser ser Lys Thr 
65 70 75 80 

val Phe Phe Glu Tyr His Leu Lys Asn Thr Gin Asp Leu Gin Glu Gly 

85 90 95 

Leu Glu Gin Ala lie Ala Pro Tyr Asn Phe val val Lys val His Lys 

100 105 110 

Lys Pro lie Asp Trp Gin Lys Gin Leu Ser Ser val His Glu Arg Lys 
115 120 125 

Ala Gly His Arg ser lie Leu Ser Asn Asn Val Gly Ala Glu He Ser 
130 135 140 

Lys Leu Ala Glu Thr Lys Asp Ser Thr Trp ser Phe lie Glu Arg Thr 
145 150 155 160 

Met Asp Leu lie Glu Ala Arg Thr Arg Gin Pro Thr Thr Arg val Ala 

165 170 175 

Tyr Arg Phe Leu Leu Gin Leu Thr Phe Met Asn Cys Cys Arg Ala Asn 

180 185 190 

Asp Leu Lys Asn Ala Asp Pro Ser Thr Phe Gin lie lie Ala Asp Pro 
195 200 205 

His Leu Gly Arg lie Leu Arg Ala Phe Val Pro Glu Thr Lys Thr Ser 
210 215 220 

lie Glu Arg Phe lie Tyr Phe Phe Pro Cys Lys Gly Arg cys Asp Pro 
225 230 235 240 

Leu Leu Ala Leu Asp Ser Tyr Leu Leu Trp Val Gly pro Val Pro Lys 

245 250 255 

Thr Gin Thr Thr Asp Glu Glu Thr Gin Tyr Asp Tyr Gin Leu Leu Gin 

260 265 270 

Asp Thr Leu Leu lie Ser Tyr Asp Arg Phe lie Ala Lys Glu Ser Lys 
275 280 285 

Glu Asn lie Phe Lys lie Pro Asn Gly Pro Lys Ala His Leu Gly Arq 
290 295 300 
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His Leu Met Ala ser Tyr Leu Gly Asn Asn ser Leu Lys Ser Glu Ala 
305 310 315 320 

Thr Leu Tyr Gly Asn Trp ser Val Glu Arg Gin Glu Gly Val Ser Lys 

325 330 335 

Met Ala Asp Ser Arg Tyr Met His Thr val Lys Lys Ser Pro Pro Ser 

340 345 350 

Tyr Leu Phe Ala Phe Leu Ser Gly Tyr Tyr Lys Lys Ser Asn Gin Gly 
355 360 365 

Glu Tyr val Leu Ala Glu Thr Leu Tyr Asn Pro Leu Asp Tyr Asp Lys 
370 375 380 

Thr Leu Pro lie Thr Thr Asn Glu Lys Leu lie Cys Arg Arg Tyr Gly 
385 390 395 400 

Lys Asn Ala Lys val lie Pro Lys Asp Ala Leu Leu Tyr Leu Tyr Thr 

405 410 415 

Tyr Ala Gin Gin Lys Arg Lys Gin Leu Ala Asp Pro Asn Glu Gin Asn 

420 425 430 

Arg Leu Phe Ser Ser Glu Ser Pro Ala His Pro Phe Leu Thr Pro Gin 
435 440 445 

Ser Thr Gly ser ser Thr Pro Leu Thr Trp Thr Ala Pro Lys Thr Leu 
450 455 460 

Ser Thr Gly Leu Met Thr Pro Gly Glu Glu 
465 470 



<210> 74 

<211> 1074 

<212> DNA 

<213> Zygosaccharomyces bailii 



<220> 

<221> CDS 

<222> (1) . . (1074) 

<223> 



<400> 74 

atg ttc tec agg gaa gag gtt agg gec tec aaa ccc act aaa gag atg 



48 
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Met Phe Ser Arg Glu Glu Val Arg Ala Ser Arg Pro Thr Lys Glu Met 
15 10 15 

aag atg ate ttt gat gtq ctt atg aca ttt cct tac ttc gcg gta cat 96 
Lys Met lie Phe Asp val Leu Met Thr Phe Pro Tyr Phe Ala val His 

20 25 30 

gtt cct tec aag aat ata ctt ate aca cca aaa ggc aca gtt gag ata 144 
val Pro ser Lys Asn He Leu lie Thr pro Lys Gly Thr val Glu lie 
35 40 45 

cct gaa aac tat caa aat tat ccc ata ttg gee ate ttc tac gtc aaa 192 
pro Glu Asn Tyr Gin Asn Tyr Pro lie Leu Ala He Phe Tyr val Lys 
50 55 60 

tat tta atg aag aaa aat ccg tac gat ctt ctt cca age acc gtg aac 240 
Tyr Leu Met Lys Lys Asn Pro Tyr Asp Leu Leu Pro Ser Thr val Asn 
65 70 75 80 

tgg ccg gaa ccc tat gta gtq gtq aat acc ate act aag cgt ttc cag 288 
Trp Pro Glu Pro Tyr val val vaT Asn Thr lie Thr Lys Arg Phe Gin 

85 90 95 

gac cat aaa eta ttt gca aac aaa aat get gat gtc tac gtt gaa aga 336 
Asp his Lys Leu Phe Ala Asn Lys Asn Ala Asp val Tyr val Glu Arq 

100 105 110 

ctt caa aat gca att gee teg ggt att aag att cct gag tct aag aag 384 
Leu Gin Asn Ala lie Ala ser Gly lie Lys lie Pro Glu Ser Lys Lys 
115 120 125 

aat gaa cga tta ggg cag cca aaa aag acg aaa aat gtt aca aaa gag 432 
Asn Glu Arg Leu Gly Gin Pro Lys Lys Thr Lys Asn val Thr Lys Glu 
130 135 140 

att gag gag acc ttt att gat gee act aat gcg aga aaa gaa ttg gat 480 
lie Glu Glu Thr Phe He Asp Ala Thr Asn Ala Arg Lys Glu Leu- Asp 
145 150 155 160 

gag tac ttc aga aaa ctt cag gat ggt aca tta acc gga gat ttg gag 528 
Glu Tyr Phe Arg Lys Leu Gin Asp Gly Thr Leu Thr Gly Asp Leu Glu 

165 170 175 

ggt ggc ttg tgc aag gtc aaa acg etc ata teg tgt aaa get ttg ttc 576 
Gly Gly Leu cys Lys val Lys Thr Leu He ser cys Lys Ala Leu Phe 

180 185 190 

gaa gga cac acc caa gaa etc cag ttt atg gee acc aat gtt cgt aaa 624 
Gly Gly His Thr Gin Glu Leu Gin Phe Met Ala Thr Asn val Arg Lys 
195 200 205 

gtc tgg ata ggg gag ata gtq tgc ggc atg gtt tec aat aaa aat gca 672 
val Trp He Gly Glu lie Val Cys Gly Met val Ser Asn Lys Asn Ala 
210 215 220 

att gac gat aat gat etc gag gaa gaa gag cgt aat gca teg ggc gaa 720 
lie Asp Asp Asn Asp Leu Glu Glu Glu Glu Arg Asn Ala Ser Gly Glu 
225 230 235 240 

caa act acg aca gee cga gag gaa tea gag get ctg gat acc aca tec 768 
Gin Thr Thr Thr Ala Arg Glu Glu Ser Glu Ala Leu Asp Thr Thr Ser 

245 250 255 

aat ggt ttg gac get ctg aat act caa att aat gec ata gaa acg gag 816 
Asn Gly Leu Asp Ala Leu Asn Thr Gin He Asn Ala He Glu Thr Glu 

260 265 270 

gaa tea ttt tgg gaa get ate agg gcg etc cat aat gag eta cgc acc 864 
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Glu Ser Phe Trp Glu Ala lie Arg Ala Leu His Asn Glu Leu Arg Thr 
275 280 285 

tct cca aca cag tta gaa gag tgc agg aaa gcg gca gtt ttt tta ctg 912 
Ser Pro Thr Gin Leu Glu Glu Cys Arg Lys Ala Ala Val Phe Leu Leu 
290 295 300 

ggc cat aaa aaa ata etc caa aca ttt aca aag caa aag gat act gec 960 
Gly His Lys Lys He Leu Gin Thr Phe Thr Lys Gin Lys Asp Thr Ala 
305 310 315 320 

cgc get ctt ttt tat ata aat etc aaa gag tgt ctg gqa acc age tgg 1008 
Arg Ala Leu Phe Tyr lie Asn Leu Lys Glu cys Leu Gly Thr ser Trp 

325 330 335 

aat tta gaa tat aca gag gca tea gat gca aga aaa atg gca att aaa 1056 
Asn Leu Glu Tyr Thr Glu Ala Ser Asp Ala Arg Lys Met Ala lie Lys 

340 345 350 

ggt gag ctt caa aat taa 1074 
Gly Glu Leu Gin Asn 
355 

<210> 75 

<211> 357 

<212> PRT 

<213> Zygosaccharomyces bailii 



<400> 7 5 

Met Phe Ser Arg Glu Glu val Arg Ala ser Arg Pro Thr Lys Glu Met 
1 5 10 15 

Lys Met lie Phe Asp Val Leu Met Thr Phe Pro Tyr Phe Ala Val His 

20 25 30 

val Pro ser Lys Asn lie Leu lie Thr Pro Lys Gly Thr Val Glu lie 
35 40 45 

Pro Glu Asn Tyr Gin Asn Tyr Pro He Leu Ala He Phe Tyr val Lys 
50 55 60 

Tyr Leu Met Lys Lys Asn Pro Tyr Asp Leu Leu Pro Ser Thr Val Asn 
65 70 75 80 

Trp Pro Glu Pro Tyr val val Val Asn Thr lie Thr Lys Arg Phe Gin 

85 90 95 

Asp His Lys Leu Phe Ala Asn Lys Asn Ala Asp val Tyr Val Glu Arg 

100 105 110 

Leu Gin Asn Ala lie Ala Ser Gly lie Lys lie Pro Glu Ser Lys Lys 
115 120 125 
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Asn Glu Arg Leu Gly Gin pro Lys Lys Thr Lys Asn val Thr Lys Glu 
130 135 140 

lie Glu Glu Thr Phe lie Asp Ala Thr Asn Ala Arg Lys Glu Leu Asp 
145 150 155 160 

Glu Tyr Phe Arg Lys Leu Gin Asp Gly Thr Leu Thr Gly Asp Leu Glu 

165 170 175 

Gly Gly Leu Cys Lys val Lys Thr Leu lie ser Cys Lys Ala Leu Phe 

180 185 190 

Gly Gly His Thr Gin Glu Leu Gin Phe Met Ala Thr Asn val Arg Lys 
195 200 205 

val Trp lie Gly Glu lie val Cys Gly Met val ser Asn Lys Asn Ala 
210 215 220 

lie Asp Asp Asn Asp Leu Glu Glu Glu Glu Arg Asn Ala Ser Gly Glu 
225 230 235 240 

Gin Thr Thr Thr Ala Arg Glu Glu Ser Glu Ala Leu Asp Thr Thr ser 

245 250 255 

Asn Gly Leu Asp Ala Leu Asn Thr Gin lie Asn Ala lie Glu Thr Glu 

260 265 270 

Glu ser Phe Trp Glu Ala lie Arg Ala Leu His Asn Glu Leu Arg Thr 
275 280 285 

Ser pro Thr Gin Leu Glu Glu Cys Arg Lys Ala Ala val Phe Leu Leu 
290 295 300 

Gly His Lys Lys lie Leu Gin Thr Phe Thr Lys Gin Lys Asp Thr Ala 
305 310 315 320 

Arg Ala Leu Phe Tyr lie Asn Leu Lys Glu Cys Leu Gly Thr ser Trp 

325 330 335 

Asn Leu Glu Tyr Thr Glu Ala Ser Asp Ala Arg Lys Met Ala lie Lys 

340 345 350 

Gly Glu Leu Gin Asn 
355 

<210> 76 

<211> 750 

<212> DNA 

<213> Zygosaccharomyces bailii 
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<220> 

<221> CDS 

<222> (1)..(750) 

<223> 



<400> 76 

atg aac tea gag ttc tct ctt gcg tac gga aat gtg gac tct gat tat 
Met Asn ser Glu Phe Ser Leu Ala Tyr Gly Asn val Asp Ser Asp Tyr 



10 15 



48 



get etc gat ctg ctt gaa egg ctt gat tec aac tgg aag ggc ace gag 96 

Ala Leu Asp Leu Leu Glu Arg Leu Asp ser Asn Trp Lys Gly Thr Glu 

20 25 30 

etc ttt act cat ata egc gaa ace ttt caa att ggc ctt ggc aat gtt 

Leu Phe Thr His lie Arg Glu Thr Phe Gin lie Gly Leu Gly Asn Val 
35 40 45 



144 



ate ata gtg tea gaa cag agt gaa age ctt aga ata ccc cct tea eta 192 
He Ile val Ser Glu Gin ser Glu Ser Leu Arg lie pro Pro Ser Leu 
50 55 ^ 60 

ctt ggt age agt agt cca gca gat age gac aat agt cct cca gga aca 240 
Leu Gly ser Ser Ser Pro Ala Asp Ser Asp Asn Ser Pro Pro Gly Thr 
65 70 75 80 

cct act aat gaa gcg caa ccc tgg ttt att tct gaa gat etc teg aaa 288 
Pro Thr Asn Glu Ala Gin pro Trp Phe Ile Ser Glu Asp Leu Ser Lys 

85 90 95 

ggc cct ttc acg gaa gee cag tea act caa tea tct att gag aca etc 336 
Gly Pro Phe Thr Glu Ala Gin Ser Thr Gin ser Ser Ile Glu Thr Leu 

100 105 ' 110 

gaa ggt gag cac cat get gtg tct tct ctg cac ctg aag eta aat ggc 384 
Glu Gly Glu His His Ala Val Ser Ser Leu His Leu Lys Leu Asn Gly 
115 120 125 

etc tec tgt att gga cgt get gta tgg egg get act egc aaa atg gat 432 
Leu ser Cys He Gly Arg Ala val Trp Arg Ala Thr Arg Lys Met Asp 
130 135 140 

acg aga aca gag gtg gac gac ata tta aac tea ata aca gaa ccc aga 480 
Thr Arg Thr Glu Val Asp Asp Ile Leu Asn Ser Ile Thr Glu Pro Arg 
145 150 155 160 

aga etc aca tta ccc ggt ate aac aag atg cgt caa tgc att gtg cgt 528 
Arg Leu Thr Leu Pro Gly He Asn Lys Met Arg Gin Cys Ile val Arg 

165 170 175 

eta ttg ctt etc gta ccg ate caa gta cga gaa gag ate ctt tct ttc 576 
Leu Leu Leu Leu Val Pro Ile Gin Val Arg Glu Glu Ile Leu Ser Phe 

180 185 190 

gec ata get teg ggg ata ccc tea gaa aca ata gaa gat att cga tct 624 
Ala lie Ala Ser Gly lie Pro Ser Glu Thr lie Glu Asp lie Arq Ser 
195 200 205 

tea aca aat att tea get gtt gat ace aat ggc aga ggc ata gca cat 672 
Ser Thr Asn He Ser Ala Val Asp Thr Asn Gly Arg Gly Ile Ala His 
210 215 220 
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aat tec aaa aag egg tea tta gcg cca aca caa gat tea cgc aat tta 720 
Asn Ser Lys Lys Arg Ser Leu Ala Pro Thr Gin Asp Ser Arg Asn Leu 
225 230 235 240 

cgc cgt cga ate agg gga cat ace caa tag 750 
Arg Arg Arg He Arg Gly His Thr Gin 

245 

<210> 77 
<211> 249 
<212> PRT 

<213> zygosaccharomyces bailii 



<400> 77 

Met Asn Ser Glu Phe ser Leu Ala Tyr Gly Asn val Asp Ser Asp Tyr 
15 10 15 

Ala Leu Asp Leu Leu Glu Arg Leu Asp Ser Asn Trp Lys Gly Thr Glu 

20 25 30 

Leu Phe Thr His lie Arg Glu Thr Phe Gin lie Gly Leu Gly Asn Val 
35 40 45 

lie lie val Ser Glu Gin ser Glu Ser Leu Arg lie Pro Pro Ser Leu 
50 55 60 

Leu Gly Ser ser Ser Pro Ala Asp Ser Asp Asn Ser pro Pro Gly Thr 
65 70 * 75 80 

Pro Thr Asn Glu Ala Gin Pro Trp Phe lie ser Glu Asp Leu Ser Lys 

85 90 95 

Gly Pro Phe Thr Glu Ala Gin ser Thr Gin Ser Ser He Glu Thr Leu 

100 105 110 

Glu Gly Glu His His Ala val Ser ser Leu His Leu Lys Leu Asn Gly 
115 120 125 

Leu ser Cys lie Gly Arg Ala val Trp Arg Ala Thr Arg Lys Met Asp 
130 135 140 

Thr Arg Thr Glu Val Asp Asp lie Leu Asn Ser lie Thr Glu Pro Arq 
145 150 155 160 

Arg Leu Thr Leu Pro Gly lie Asn Lys Met Arg Gin cys lie Val Arq 

165 170 175 

Leu Leu Leu Leu Val Pro lie Gin Val Arg Glu Glu lie Leu Ser Phe 

180 185 190 
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Ala lie Ala Ser Gly He Pro Ser Glu Thr lie Glu Asp He Arg Ser 
195 200 205 

. Ser Thr Asn He Ser Ala val Asp Thr Asn Gly Arg Gly lie Ala His 
210 215 220 

Asn Ser Lys Lys Arg ser Leu Ala Pro Thr Gin Asp Ser Arg Asn Leu 
225 230 235 240 

Arg Arg Arg lie Arg Gly His Thr Gin 

245 

<210> 78 
<211> 453 
<212> DNA 

<213> Saccharomyces cerevisiae 
<220> 

<221> promoter 
<222> CD.. (4 50) 
<223> 

<220> 

r r 

<221> misc_feature 
<222> C451)..(453) 
<223> start codon 



<400> 78 



ctacttattc 


ccttcgagat 


tatatctagg 


aacccatcag gttggtggaa 


gattacccgt 


60 


tctaagactt 


ttcagcttcc 


tctattgatg 


ttacacctgg 


acaccccttt 


tctggcatcc 


120 


agtttttaat 


cttcagtggc 


atgtgagatt 


ctccgaaatt 


aattaaagca 


atcacacaat 


180 


tctctcggat 


accacctcgg 


ttgaaactga 


caggtggttt 


gttacgcatg 


ctaatgcaaa 


240 


ggagcctata 


tacctttggc 


tcggctgctg 


taacagggaa 


tataaagggc 


agcataattt 


300 


aggagtttag 


tgaacttgca 


acatttacta 


ttttcccttc ttacgtaaat 


atttttcttt 


360 


ttaattctaa 


atcaatcttt 


ttcaattttt 


tgtttgtatt 


cttttcttgc 


ttaaatctat 


420 


aactacaaaa 


aacacataca 


taaactaaaa 


atg 






453 



<210> 79 
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<211> 499 

<212> DNA 

<213> zygosaccharomyces bailii 
<220> 

<221> promoter 

<222> (1) . . (496) 
<223> 

<220> 

<221> misc_feature 

<222> (497) . . C499) 

<223> start codon 



<400> 79 
ggatcgtatt 


gcttccattc 


ttcttttgtt 


attcggcgcg 


attcgaattc 


atgacatctt 


60 


ttaaccgtcc 


gcactacatt 


actggctcaa 


gaaaggattg 


ataaatacta 


ccaaggaaca 


120 


cgtgtatcca 


tttgatactg 


tgctggttac 


aagacacatg 


ctttacaagc 


acacttctat 


180 


ctctctcgac 


tgaggcgaaa 


cgtcgagtgg 


tttgatatca 


aatgcatgcg 


tgatatgcac 


240 


cattattttt 


cccttttact 


tccgtcacgc 


cggggctcca 


cttttttggg 


ttccactttt 


300 


cttacgaccc 


tcgacatcca 


ctaaacgaac 


aggaagtcaa 


agaacccctc 


gagtcacacg- 


360 


gtgcgtatgc 


gctgttaaca 


tatataaagg 


tcacctttcc 


ctgctcaaaa 


gagtcttagc 


420 


aggctgttaa 


cttcactctc 


tatcgatcca 


tagaatctaa 


ctaacaagag 


actacatcgg 


480 


tataacaaat 


aacaaaatg 










499 



<210> 80 

<211> 27 

<212> DNA 

<213> artificial sequence 
<220> 

<223> PCR primer 

<400> 80 

aagagactcc aacgtcgcgc acctgta 27 

<210> 81 
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<211> 32 
<212> DNA 

<213> artificial sequence 



P779.ST25 



<220> 

<223> PCR primer 
<400> 81 

agaggattag gaagacacaa attgcatggt ga 



32 



<210> 82 

<211> 29 

<212> DNA 

<213> artificial sequence 



<220> 

<223> PCR primer 
<400> 82 

atcgtattgc ttccattctt cttttgtta 



29 



<210> 83 

<211> 29 

<212> DNA 

<213> artificial sequence 



<220> 

<223> PCR primer 
<400> 83 

tttgttattt gttataccga tgtagtctc 



29 



<210> 84 

<211> 27 

<212> DNA 

<213> artificial sequence 



<220> 

<223> PCR primer 
<400> 84 

tagctactct tctccaggtg tcattag 



27 
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<210> 85 
<211> 25 
<212> DNA 

<213> artificial sequence 
<220> 

<223> PCR primer 
<400> 85 

cctatgtccg agtttagcga gcttg 25 

<210> 86 

<211> 25 

<212> DNA 

<213> artificial sequence 
<220> 

<223> PCR primer 

<400> 86 

agaatgaact cagagttctc tcttg 25 

<210> 87 

<211> 22 

<212> DNA 

<213> artificial sequence 
<220> 

<223> PCR primer 

<400> 87 

attctattgg gtatgtcccc tg 22 

<210> 88 

<211> 30 

<212> DNA 

<213> artificial sequence 



<220> 
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<223> PCR primer 
<400> 88 

gtttttaatt ttgaagctca cctttaattg 30 

<210> 89 

<211> 26 

<212> DNA 

<213> artificial sequence 
<220> 

<223> PCR primer 

<400> 89 

attatgttct ccagggaaga ggttag 26 

<210> 90 

<211> 27 

<212> DNA 

<213> artificial sequence 
<220> 

<223> PCR primer 

<400> 90 

agaatcaatc atttagtgtg gcaggag * 27 - 

<210> 91 

<211> 25 

<212> DNA 

<213> artificial sequence 
<220> 

<223> pcr primer 

<400> 91 

taaaaactgc ccgccatatt tcgtc 25 

<210> 92 

<211> 708 
<212> DNA 

<213> Zygosaccharomyces rouxii 
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<400> 92 

ctgcagaaag ccctaagatg ctcctcccgt tcacatgctc cgaacccttt ggaaaattct 
gtgcgcggcg ctagcacgta atgacccttg atgacaaact ccaatggtat caccctactg 
tcctctcccc ctcccctttt tttccttctt tctttccatc tatttctgat ctcctcccct 
cagcagatgt cccgaaaggt acagctgcga tacgggcagc cactttttga cgtctcgcaa 
caggatcacc ctgcacgacg gggcacaata ggattcccgt tggcacggtg ctggtgtata 
gccgccgagg gtggggtata aagggctaca tccttacccc cacgcaggcg ataacccgca 
tcatacaact gtcctcctct tccgctctcg ccactagccg ccgaaccatt gctaccgcaa 
tgacaccgtg tggtgatctc aagggaggat gtgtgggtgt gggacggaac ttccactttt 
tcctcagtag gtgcgatgcc ccctacaccg agcttccact aacgtgtttc agcggttgaa 
ggcaatggga tcgcagaatt atcgcagctt gttggtatat aaagggagaa gatatatgga 
taagagacat gttctacttc tgttcrctct ttctttttat cctatatcac cagaacaaat 
caagttcgca ttgattcata tcaaataaaa agtacatcac agataaca 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
708 



<210> 93 

<211> 21 

<212> DNA 

<213> arti f i ci al sequence 



<220> 

<223> PCR primer 
<400> 93 

TGCAGAAAGC CCTAAGATGC T 



21 



<210> 94 

<211> 29 

<212> DNA 

<213> artificial sequence 



<220> 

<223> PCR primer 
<400> 94 

TGTCTGTGAT GTACTTTTTA TTTGATATG 



29 



<210> 95 

<211> 25 

<212> DNA 

<213> artificial sequence 



<220> 

<223> PCR primer 
<400> 95 

ACGCAAGAGA GAACTCTGAG TTCAT 



25 
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